Genome Informatics Section

Our section develops and applies computational methods for the analysis of massive genomics datasets, focusing on the challenges of genome sequencing and comparative genomics. We aim to improve such foundational processes and translate emerging genomic technologies into practice.


The (near) complete sequence of a human genome

September 22, 2020

The Telomere-to-Telomere (T2T) consortium is proud to announce our v1.0 assembly of a complete human genome. This post briefly summarizes our work over the past year, including a month-long virtual workshop in June, as we strove to complete as many human chromosomes as possible. Our progress over the summer exceeded our wildest expectations and resulted in the completion of all human chromosomes, with the only exception being the 5 rDNA arrays. Our v1.0 assembly includes more than 100 Mbp of novel sequence compared to GRCh38, achieves near-perfect sequence accuracy, and unlocks the most complex regions of the genome to functional study. We plan to release a series of preprints in the coming months that fully describe our methods and analyses, but due to its tremendous value, we are releasing the assembly immediately.

We are hiring!

September 21, 2020

[Update December 17, 2020: these positions have now been filled] The Genome Informatics Section is hiring! Come join our outstanding team at the NIH’s National Human Genome Research Institute and contribute to the development of new reference genomes and computational methods for DNA sequencing and analysis. Both postdoc and engineering positions are available. More information and application instructions follow below.

The genomic structure of a human chromosome 22 nucleolar organizer region determined by TAR cloning
Scientific reports, February 4, 2021
Kim JH, Noskov VN, Ogurtsov AY, Nagaraja R, Petrov N, Liskovykh M, Walenz BP, Lee HS, Kouprina N, Phillippy AM, Shabalina SA, Schlessinger D, Larionov V


A single molecule sequence assembler for genomes large and small


Fast genome and metagenome distance and containment estimation using MinHash


Interactively explore metagenomes and more from a web browser