File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: FlowGrid enables fast clustering of very large single-cell RNA-seq data

TitleFlowGrid enables fast clustering of very large single-cell RNA-seq data
Authors
Issue Date2022
PublisherOxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/
Citation
Bioinformatics, 2022, v. 38 n. 1, p. 282-283 How to Cite?
AbstractMotivation: Scalable clustering algorithms are needed to analyze millions of cells in single cell RNA-seq (scRNA-seq) data. Results: Here, we present an open source python package called FlowGrid that can integrate into the Scanpy workflow to perform clustering on very large scRNA-seq datasets. FlowGrid implements a fast density-based clustering algorithm originally designed for flow cytometry data analysis. We introduce a new automated parameter tuning procedure, and show that FlowGrid can achieve comparable clustering accuracy as state-of-the-art clustering algorithms but at a substantially reduced run time for very large single cell RNA-seq datasets. For example, FlowGrid can complete a one-hour clustering task for one million cells in about five min. Availability and implementation: https://github.com/holab-hku/FlowGrid. Supplementary information: Supplementary data are available at Bioinformatics online.
Persistent Identifierhttp://hdl.handle.net/10722/301912
ISSN
2021 Impact Factor: 6.931
2020 SCImago Journal Rankings: 3.599
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorFang, X-
dc.contributor.authorHo, JWK-
dc.date.accessioned2021-08-21T03:28:49Z-
dc.date.available2021-08-21T03:28:49Z-
dc.date.issued2022-
dc.identifier.citationBioinformatics, 2022, v. 38 n. 1, p. 282-283-
dc.identifier.issn1367-4803-
dc.identifier.urihttp://hdl.handle.net/10722/301912-
dc.description.abstractMotivation: Scalable clustering algorithms are needed to analyze millions of cells in single cell RNA-seq (scRNA-seq) data. Results: Here, we present an open source python package called FlowGrid that can integrate into the Scanpy workflow to perform clustering on very large scRNA-seq datasets. FlowGrid implements a fast density-based clustering algorithm originally designed for flow cytometry data analysis. We introduce a new automated parameter tuning procedure, and show that FlowGrid can achieve comparable clustering accuracy as state-of-the-art clustering algorithms but at a substantially reduced run time for very large single cell RNA-seq datasets. For example, FlowGrid can complete a one-hour clustering task for one million cells in about five min. Availability and implementation: https://github.com/holab-hku/FlowGrid. Supplementary information: Supplementary data are available at Bioinformatics online.-
dc.languageeng-
dc.publisherOxford University Press. The Journal's web site is located at http://bioinformatics.oxfordjournals.org/-
dc.relation.ispartofBioinformatics-
dc.titleFlowGrid enables fast clustering of very large single-cell RNA-seq data-
dc.typeArticle-
dc.identifier.emailHo, JWK: jwkho@hku.hk-
dc.identifier.authorityHo, JWK=rp02436-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1093/bioinformatics/btab521-
dc.identifier.pmid34289014-
dc.identifier.hkuros324193-
dc.identifier.volume38-
dc.identifier.issue1-
dc.identifier.spage282-
dc.identifier.epage283-
dc.identifier.isiWOS:000736120000045-
dc.publisher.placeUnited Kingdom-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats