Phased diploid genome assembly with single-molecule real-time sequencing

Author(s): Concepcion, G. and Chin, C. S. and Peluso, P. and Sedlazeck, F. J. and Nattestad, M. and Clum, A. and Dunn, C. and O'Malley, R. and Figueroa-Balderas, F. and Morales-Cruz, A. and Cramer, G. R. and Delledonne, M. and Luo, C. and Ecker, J. R. and Cantu, D. and Rank, D. R. and Schatz, M. C.

While genome assembly projects have been successful in many haploid and inbred species, the assembly of non-inbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms ( to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.

Organization: PacBio
Year: 2017

View Conference Poster




在本网页上注册,即表示您同意,并同意 PacBio 根据我们的隐私政策收集和使用该信息.