File Download

There are no files associated with this item.

Supplementary

Article: Clair3-Trio: high-performance Nanopore long-read variant calling in family trios with Trio-to-Trio deep neural networks

TitleClair3-Trio: high-performance Nanopore long-read variant calling in family trios with Trio-to-Trio deep neural networks
Authors
Issue Date2022
Citation
Briefing in Bioinformatics, 2022 How to Cite?
AbstractAccurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio.
Persistent Identifierhttp://hdl.handle.net/10722/316297

 

DC FieldValueLanguage
dc.contributor.authorSU, J-
dc.contributor.authorZHENG, Z-
dc.contributor.authorAhmed, SS-
dc.contributor.authorLam, TW-
dc.contributor.authorLuo, R-
dc.date.accessioned2022-09-02T06:09:00Z-
dc.date.available2022-09-02T06:09:00Z-
dc.date.issued2022-
dc.identifier.citationBriefing in Bioinformatics, 2022-
dc.identifier.urihttp://hdl.handle.net/10722/316297-
dc.description.abstractAccurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio.-
dc.languageeng-
dc.relation.ispartofBriefing in Bioinformatics-
dc.titleClair3-Trio: high-performance Nanopore long-read variant calling in family trios with Trio-to-Trio deep neural networks-
dc.typeArticle-
dc.identifier.emailAhmed, SS: ssahmed@hku.hk-
dc.identifier.emailLam, TW: twlam@cs.hku.hk-
dc.identifier.emailLuo, R: rbluo@cs.hku.hk-
dc.identifier.authorityLam, TW=rp00135-
dc.identifier.authorityLuo, R=rp02360-
dc.identifier.hkuros336224-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats