MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)

LUO, R; CHAN, SH; LAW, WC; He, G; Yu, C; LIU, CM; Zhou, D; Li, Y; Li, R; Wang, J; Zhu, X; Peng, S; Lam, TW

File Download

content.pdf

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1186/1471-2105-16-S7-S10
Scopus: eid_2-s2.0-84977534075
PMID: 25952019
WOS: WOS:000353976600010
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
- PubMed Central: 0
Appears in Collections:
- Computer Science: Journal/Magazine Articles

Article: MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)

Title	MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)
Authors	LUO, R CHAN, SH LAW, WC He, G Yu, C LIU, CM Zhou, D Li, Y Li, R Wang, J Zhu, X Peng, S Lam, TW
Issue Date	2015
Citation	BMC Bioinformatics, 2015, v. 16 n. suppl. 7, article no. S10 How to Cite? DOI: http://dx.doi.org/10.1186/1471-2105-16-S7-S10
Abstract	Background: Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alterative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC card contains only ~60 cores (while a GPU card typically has over a thousand cores). Results: To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC's limitation and the extra parallelism inside each MIC core. By utilizing the 512-bit vector units in the MIC and implementing a new seeding strategy, experiments on aligning 150 bp paired-end reads show that MICA using one MIC card is 4.9 times faster than BWA-MEM (using 6 cores of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA's simplicity allows very efficient scale-up when multiple MIC cards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). Summary: MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour using 400 nodes. MICA has impressive performance even though MIC is only in its initial stage of development. Availability and implementation: MICA's source code is freely available at http://sourceforge.net/projects/mica-aligner under GPL v3. Supplementary information: Supplementary information is available as "Additional File 1". Datasets are available at www.bio8.cs.hku.hk/dataset/mica.
Persistent Identifier	http://hdl.handle.net/10722/215514
ISSN	1471-2105 2021 Impact Factor: 3.307 2020 SCImago Journal Rankings: 1.567
ISI Accession Number ID	WOS:000353976600010

DC Field	Value	Language
dc.contributor.author	LUO, R	-
dc.contributor.author	CHAN, SH	-
dc.contributor.author	LAW, WC	-
dc.contributor.author	He, G	-
dc.contributor.author	Yu, C	-
dc.contributor.author	LIU, CM	-
dc.contributor.author	Zhou, D	-
dc.contributor.author	Li, Y	-
dc.contributor.author	Li, R	-
dc.contributor.author	Wang, J	-
dc.contributor.author	Zhu, X	-
dc.contributor.author	Peng, S	-
dc.contributor.author	Lam, TW	-
dc.date.accessioned	2015-08-21T13:28:36Z	-
dc.date.available	2015-08-21T13:28:36Z	-
dc.date.issued	2015	-
dc.identifier.citation	BMC Bioinformatics, 2015, v. 16 n. suppl. 7, article no. S10	-
dc.identifier.issn	1471-2105	-
dc.identifier.uri	http://hdl.handle.net/10722/215514	-
dc.description.abstract	Background: Short-read aligners have recently gained a lot of speed by exploiting the massive parallelism of GPU. An uprising alterative to GPU is Intel MIC; supercomputers like Tianhe-2, currently top of TOP500, is built with 48,000 MIC boards to offer ~55 PFLOPS. The CPU-like architecture of MIC allows CPU-based software to be parallelized easily; however, the performance is often inferior to GPU counterparts as an MIC card contains only ~60 cores (while a GPU card typically has over a thousand cores). Results: To better utilize MIC-enabled computers for NGS data analysis, we developed a new short-read aligner MICA that is optimized in view of MIC's limitation and the extra parallelism inside each MIC core. By utilizing the 512-bit vector units in the MIC and implementing a new seeding strategy, experiments on aligning 150 bp paired-end reads show that MICA using one MIC card is 4.9 times faster than BWA-MEM (using 6 cores of a top-end CPU), and slightly faster than SOAP3-dp (using a GPU). Furthermore, MICA's simplicity allows very efficient scale-up when multiple MIC cards are used in a node (3 cards give a 14.1-fold speedup over BWA-MEM). Summary: MICA can be readily used by MIC-enabled supercomputers for production purpose. We have tested MICA on Tianhe-2 with 90 WGS samples (17.47 Tera-bases), which can be aligned in an hour using 400 nodes. MICA has impressive performance even though MIC is only in its initial stage of development. Availability and implementation: MICA's source code is freely available at http://sourceforge.net/projects/mica-aligner under GPL v3. Supplementary information: Supplementary information is available as "Additional File 1". Datasets are available at www.bio8.cs.hku.hk/dataset/mica.	-
dc.language	eng	-
dc.relation.ispartof	BMC Bioinformatics	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.title	MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)	-
dc.type	Article	-
dc.identifier.email	He, G: gzhe2@HKUCC-COM.hku.hk	-
dc.identifier.email	Lam, TW: twlam@cs.hku.hk	-
dc.identifier.authority	Lam, TW=rp00135	-
dc.description.nature	published_or_final_version	-
dc.identifier.doi	10.1186/1471-2105-16-S7-S10	-
dc.identifier.pmid	25952019	-
dc.identifier.scopus	eid_2-s2.0-84977534075	-
dc.identifier.hkuros	248453	-
dc.identifier.volume	16	-
dc.identifier.issue	suppl. 7	-
dc.identifier.spage	article no. S10	-
dc.identifier.epage	article no. S10	-
dc.identifier.isi	WOS:000353976600010	-
dc.identifier.issnl	1471-2105	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Article: MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC)

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats