File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Exploring the limit of using a deep neural network on pileup data for germline variant calling

TitleExploring the limit of using a deep neural network on pileup data for germline variant calling
Authors
Issue Date2020
PublisherNature Research (part of Springer Nature). The Journal's web site is located at https://www.nature.com/natmachintell/
Citation
Nature Machine Intelligence, 2020, v. 2 n. 4, p. 220-227 How to Cite?
AbstractSingle-molecule sequencing technologies have emerged in recent years and revolutionized structural variant calling, complex genome assembly and epigenetic mark detection. However, the lack of a highly accurate small variant caller has limited these technologies from being more widely used. Here, we present Clair, the successor to Clairvoyante, a program for fast and accurate germline small variant calling, using single-molecule sequencing data. For Oxford Nanopore Technology data, Clair achieves better precision, recall and speed than several competing programs, including Clairvoyante, Longshot and Medaka. Through studying the missed variants and benchmarking intentionally overfitted models, we found that Clair may be approaching the limit of possible accuracy for germline small variant calling using pileup data and deep neural networks. Clair requires only a conventional central processing unit (CPU) for variant calling and is an open-source project available at https://github.com/HKU-BAL/Clair.
Persistent Identifierhttp://hdl.handle.net/10722/282021
ISSN
2023 Impact Factor: 18.8
2023 SCImago Journal Rankings: 5.940
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLuo, R-
dc.contributor.authorWong, C-L-
dc.contributor.authorWong, Y-S-
dc.contributor.authorTang, C-I-
dc.contributor.authorLiu, C-M-
dc.contributor.authorLeung, C-M-
dc.contributor.authorLam, T-W-
dc.date.accessioned2020-04-19T03:34:16Z-
dc.date.available2020-04-19T03:34:16Z-
dc.date.issued2020-
dc.identifier.citationNature Machine Intelligence, 2020, v. 2 n. 4, p. 220-227-
dc.identifier.issn2522-5839-
dc.identifier.urihttp://hdl.handle.net/10722/282021-
dc.description.abstractSingle-molecule sequencing technologies have emerged in recent years and revolutionized structural variant calling, complex genome assembly and epigenetic mark detection. However, the lack of a highly accurate small variant caller has limited these technologies from being more widely used. Here, we present Clair, the successor to Clairvoyante, a program for fast and accurate germline small variant calling, using single-molecule sequencing data. For Oxford Nanopore Technology data, Clair achieves better precision, recall and speed than several competing programs, including Clairvoyante, Longshot and Medaka. Through studying the missed variants and benchmarking intentionally overfitted models, we found that Clair may be approaching the limit of possible accuracy for germline small variant calling using pileup data and deep neural networks. Clair requires only a conventional central processing unit (CPU) for variant calling and is an open-source project available at https://github.com/HKU-BAL/Clair.-
dc.languageeng-
dc.publisherNature Research (part of Springer Nature). The Journal's web site is located at https://www.nature.com/natmachintell/-
dc.relation.ispartofNature Machine Intelligence-
dc.rightsThis is a post-peer-review, pre-copyedit version of an article published in Nature Machine Intelligence. The final authenticated version is available online at: https://doi.org/10.1038/s42256-020-0167-4-
dc.titleExploring the limit of using a deep neural network on pileup data for germline variant calling-
dc.typeArticle-
dc.identifier.emailLuo, R: rbluo@cs.hku.hk-
dc.identifier.emailWong, C-L: clacm@hku.hk-
dc.identifier.emailWong, Y-S: ncls935@hku.hk-
dc.identifier.emailLiu, C-M: imcx@hku.hk-
dc.identifier.emailLeung, C-M: cmleung3@hku.hk-
dc.identifier.emailLam, T-W: twlam@cs.hku.hk-
dc.identifier.authorityLuo, R=rp02360-
dc.identifier.authorityLeung, C-M=rp00144-
dc.identifier.authorityLam, T-W=rp00135-
dc.description.naturepreprint-
dc.identifier.doi10.1038/s42256-020-0167-4-
dc.identifier.hkuros309705-
dc.identifier.volume2-
dc.identifier.issue4-
dc.identifier.spage220-
dc.identifier.epage227-
dc.identifier.eissn2522-5839-
dc.identifier.isiWOS:000571260800006-
dc.publisher.placeUnited Kingdom-
dc.identifier.issnl2522-5839-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats