Logan's mission: democratize access to Earth’s genome
Earth’s genetic diversity is a heritage of humanity. For 15+ years, next-generation sequencing has accumulated petabytes (millions of gigabytes) data across tens of millions of datasets, giving us a glimpse of the whole-Earth genome. This data is stored at the Sequence Read Archive (SRA), the world's largest public repository of sequencing data. The SRA is increasing exponentially, far outpacing the capacity of standard tools such as BLAST, and is thus largely inaccessible for most practical purposes.
We aim to open up this trove of genetic data via two strategies: assembly and search.
📋 Logan assembly
Logan contigs on AWS.
🔎 Logan online search
https://logan-search.org.
Logan snapshot 240101
Covers 27.8 million SRA datasets totalling ~5 × 1016 bases.
Paper
Chikhi, Rayan, et al. "Logan: planetary-scale genome assembly surveys life’s diversity." bioRxiv (2024): 2024-07.
Earth’s genetic diversity is a heritage of humanity. For 15+ years, next-generation sequencing has accumulated petabytes (millions of gigabytes) data across tens of millions of datasets, giving us a glimpse of the whole-Earth genome. This data is stored at the Sequence Read Archive (SRA), the world's largest public repository of sequencing data. The SRA is increasing exponentially, far outpacing the capacity of standard tools such as BLAST, and is thus largely inaccessible for most practical purposes.
We aim to open up this trove of genetic data via two strategies: assembly and search.

Logan contigs on AWS.
🔎 Logan online search
https://logan-search.org.
Logan snapshot 240101
Covers 27.8 million SRA datasets totalling ~5 × 1016 bases.
Paper
Chikhi, Rayan, et al. "Logan: planetary-scale genome assembly surveys life’s diversity." bioRxiv (2024): 2024-07.