Haplotyping of full-length transcript reads from long-read sequencing can reveal allelic imbalances in isoform expression

Author(s): Tseng, E., and Smith, T., and Kingan, S., and Hiendleder, S., and Liu, C., and Williams, J.

The Pacific Biosciences Iso-Seq method, which can produce high-quality isoform sequences of 10 kb and longer, has been used to annotate many important plant and animal genomes. Here, we develop an algorithm called IsoPhase that postprocesses Iso-Seq data to retrieve allele specific isoform information. Using simulated data, we show that for both diploid and tetraploid genomes, IsoPhase results in good SNP recovery with low FDR at error rates consistent with CCS reads. We apply IsoPhase to a haplotyperesolved genome assembly and multiple fetal tissue Iso-Seq dataset from a F1 cross of Angus x Brahman cattle subspecies. IsoPhase-called haplotypes were validated by the phased assembly and demonstrate the potential for revealing allelic imbalances in isoform expression.

Organization: PacBio
Year: 2018

View Conference Poster




在本网页上注册,即表示您同意,并同意 PacBio 根据我们的隐私政策收集和使用该信息.