File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Repun: an accurate small variant representation unification method for multiple sequencing platforms

TitleRepun: an accurate small variant representation unification method for multiple sequencing platforms
Authors
Keywordshaplotype comparison
multiple platform sequencing
representation unification
variant calling
variant representation
Issue Date1-Jan-2025
PublisherOxford University Press
Citation
Briefings in Bioinformatics, 2025, v. 26, n. 1 How to Cite?
AbstractEnsuring a unified variant representation aligning the sequencing data is critical for downstream analysis as variant representation may differ across platforms and sequencing conditions. Current approaches typically treat variant unification as a post-step following variant calling and are incapable of measuring the correct variant representation from the outset. Aligning variant representations with the alignment before variant calling has benefits like providing reliable training labels for deep learning-based variant caller model training and enabling direct assessment of alignment quality. However, it also poses challenges due to the large number of candidates to handle. Here, we present Repun, a haplotype-aware variant-alignment unification algorithm that harmonizes the variant representation between provided variants and alignments in different sequencing platforms. Repun leverages phasing to facilitate equivalent haplotype matches between variants and alignments. Our approach reduced the comparisons between variant haplotypes and candidate haplotypes by utilizing haplotypes with read evidence to speed up the unification process. Repun achieved >99.99% precision and > 99.5% recall through extensive evaluations of various Genome in a Bottle Consortium samples encompassing three sequencing platforms: Oxford Nanopore Technology, Pacific Biosciences, and Illumina. Repun is open-source and available at (https://github.com/zhengzhenxian/Repun).
Persistent Identifierhttp://hdl.handle.net/10722/355822
ISSN
2023 Impact Factor: 6.8
2023 SCImago Journal Rankings: 2.143
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZheng, Zhenxian-
dc.contributor.authorRen, Yingxuan-
dc.contributor.authorChen, Lei-
dc.contributor.authorWong, Angel On Ki-
dc.contributor.authorLi, Shumin-
dc.contributor.authorYu, Xian-
dc.contributor.authorLam, Tak Wah-
dc.contributor.authorLuo, Ruibang-
dc.date.accessioned2025-05-17T00:35:18Z-
dc.date.available2025-05-17T00:35:18Z-
dc.date.issued2025-01-01-
dc.identifier.citationBriefings in Bioinformatics, 2025, v. 26, n. 1-
dc.identifier.issn1467-5463-
dc.identifier.urihttp://hdl.handle.net/10722/355822-
dc.description.abstractEnsuring a unified variant representation aligning the sequencing data is critical for downstream analysis as variant representation may differ across platforms and sequencing conditions. Current approaches typically treat variant unification as a post-step following variant calling and are incapable of measuring the correct variant representation from the outset. Aligning variant representations with the alignment before variant calling has benefits like providing reliable training labels for deep learning-based variant caller model training and enabling direct assessment of alignment quality. However, it also poses challenges due to the large number of candidates to handle. Here, we present Repun, a haplotype-aware variant-alignment unification algorithm that harmonizes the variant representation between provided variants and alignments in different sequencing platforms. Repun leverages phasing to facilitate equivalent haplotype matches between variants and alignments. Our approach reduced the comparisons between variant haplotypes and candidate haplotypes by utilizing haplotypes with read evidence to speed up the unification process. Repun achieved >99.99% precision and > 99.5% recall through extensive evaluations of various Genome in a Bottle Consortium samples encompassing three sequencing platforms: Oxford Nanopore Technology, Pacific Biosciences, and Illumina. Repun is open-source and available at (https://github.com/zhengzhenxian/Repun).-
dc.languageeng-
dc.publisherOxford University Press-
dc.relation.ispartofBriefings in Bioinformatics-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjecthaplotype comparison-
dc.subjectmultiple platform sequencing-
dc.subjectrepresentation unification-
dc.subjectvariant calling-
dc.subjectvariant representation-
dc.titleRepun: an accurate small variant representation unification method for multiple sequencing platforms-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.1093/bib/bbae613-
dc.identifier.pmid39584701-
dc.identifier.scopuseid_2-s2.0-85210549310-
dc.identifier.volume26-
dc.identifier.issue1-
dc.identifier.eissn1477-4054-
dc.identifier.isiWOS:001363204200001-
dc.identifier.issnl1467-5463-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats