A genomic database for the Chinese population with applications to precision medicine


Grant Data
Project Title
A genomic database for the Chinese population with applications to precision medicine
Principal Investigator
Professor Lam, Tak Wah   (Project coordinator)
Co-Investigator(s)
Dr Chan Tsun Leung   (Co-Investigator)
Mr Luo Ruibang   (Co-Investigator)
Professor Cheung David Wai Lok   (Co-Investigator)
Duration
24
Start Date
2015-12-01
Completion Date
2017-11-30
Amount
6804550
Conference Title
Presentation Title
Keywords
genomic database, Chinese population, precision medicine
Discipline
Others - Computing Science and Information Technology
Panel
Engineering
HKU Project Code
ITS/155/15FP
Grant Type
Innovation and Technology Support Programme (Tier 2)
Funding Year
2015/2016
Status
On-going
Objectives
Data availability is a determining issue. The two biggest sequencing providers in China, BGI and Novogene, own over 80% of the Chinese sequencing data worldwide. They have agreed to provide this project with all their available data (~150,000 individuals). The raw data have hundreds of petabytes, with varying formats, quality and associated information. Data standardization and unified re-analysis are required before we construct a genomic database. We partner with TianHe-2 supercomputer for the needed computing resources. We will design a novel storage and query scheme for the genomic database. The objective is to enable the database to represent precise distribution of genetic variations in the population. This genomic database is useful to many sectors, including clinical practitioners for disease diagnosis and prescription, academic researchers for knowledge discovery, applications in personal genomics and pharmaceutical industry, etc. We will conduct pilot studies with our industry partners, including Hong Kong Sanatorium & Hospital to validate the project deliverables