Genome Informatics Section

Our section develops and applies computational methods for the analysis of massive genomics datasets, focusing on the challenges of genome sequencing and comparative genomics. We aim to improve such foundational processes and translate emerging genomic technologies into practice.

People

News

Choose your reference wisely

October 13, 2025

In honor of ASHG week, see “Choose your human genome reference wisely”(no paywall), in which Vivien Marx interviewed me on the state of the human reference genome. Vivien is always fun to chat with and I was in a slightly opinionated mood from the start — “The idea of a single reference genome is outdated,” says NIH researcher Adam Phillippy. Some of my other quotes follow with a little added context.

[ … entire post ]

The formation and propagation of human Robertsonian chromosomes

September 24, 2025

🚂 The T2T train keeps rolling. Our latest work investigating the structure and cause of Robertsonian chromosomes with the Gerton and Garrison labs is out today in the journal Nature! What’s a Robertsonian chromosome and why do they matter? More info below the break…

[ … entire post ]

The Q100 preprint!

September 22, 2025

We are delighted to finally announce a preprint describing the Q100 project, where we finished the HG002 genome to near-perfect accuracy: A complete diploid human genome benchmark for personalized genomics

[ … entire post ]

Publications

see all 250 publications

Biobank-scale genotyping of Robertsonian translocations reveals hidden structural variation on the human acrocentric chromosomes

bioRxiv, March 10, 2026

Full text

Rhie A, Kim J, Rodriguez-Algarra F, Solar S, Koren S, Antipov D, Wilczewski CM, Maxwell GL, Gerton J, Paschall J, Potapova T, Wolfsberg TG, Singh S, del Castillo del Rio SO, Human Pangenome Reference Consortium, Turner C, Rakyan VK, Phillippy AM

Automatic Generation of Model Sequences for Complex Regions in Assembly Graphs

bioRxiv, March 10, 2026

Full text

Antipov D, Chen Y, Sollitto M, Phillippy AM, Formenti G, Koren S

The complete genome of the KOLF2.1J reference iPSC line

bioRxiv, March 9, 2026

Full text

Alvarez Jerez P, Rhie A, Kim J … Antipov D, Koren S … Hansen NF … Phillippy AM, Blauwendraat C

Software

see all 23 projects

Verkko

Verkko is a hybrid genome assembly pipeline developed for telomere-to-telomere assembly of long accurate (e.g. PacBio HiFi) and ultra-long (e.g. Oxford Nanopore UL) reads. Verkko is Finnish for net, mesh and graph.

Mash

Fast genome and metagenome distance and containment estimation using MinHash

Krona

Interactively explore metagenomes and more from a web browser

Merqury

Evaluate genome assemblies with k-mers and more

ModDotPlot

ModDotPlot is a dotplot visualization tool, built to quickly visualize large complex repeats within whole chromosomes.