File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

postgraduate thesis: HLA typing from next generation sequencing data : strategies, methods, and applications

TitleHLA typing from next generation sequencing data : strategies, methods, and applications
Authors
Issue Date2015
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Huang, Y. [黄亚志]. (2015). HLA typing from next generation sequencing data : strategies, methods, and applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5736651.
AbstractHLA genes play a key role in the human immune system. As a result, HLA typing is very important for both clinical laboratories and biomedical research. However, HLA typing has always been challenging due to the complexity of this group of genes, including existence of large number of alleles for most HLA genes, major sequence difference between these alleles, sequence similarity among the paralogous HLA genes, and long range linkage disequilibrium (LD) in this region. With the development of NGS, large amount of sequencing data are becoming widely available. Although most of them were not generated for this purpose, they still provide valuable resources for HLA typing. NGS data might be useful in many aspects such as preliminary screening for potential organ donors, for individuals that are potentially susceptible to adverse drug responses, risk prediction for complex diseases, and population genetic studies. However, due to the complexity of the HLA loci, the large amount of NGS data has not been made informative on HLA genotypes. Many efforts have been made on HLA typing by mining NGS data, including the alignment-based method that relies on counting the number of short reads aligned to each specific allele, the assembly- and scoring-based method that takes into account of good quality contigs and their scores for each candidate HLA allele. These methods capitalize on the increasing accessibility and affordability of NGS sequencing and have greatly reduced the time and cost required to make an HLA call comparing to traditional standard PCR-based solutions. Unfortunately, these methods are only capable of achieving low-digit resolution and perform poorly at higher-digit resolution, which is required for clinical applications. In this study, I introduce a novel approach for accurate HLA typing at high-digit resolution based on a strategy of comparing sequence reads to a comprehensive reference panel containing all the known HLA alleles for high efficiency mapping, followed by assembly of the mapped reads to contigs, stepwise matching and designation of the contigs to HLA alleles and decision on HLA allele calling. Testing of the method on a set of public and internal whole exome sequencing (WES) data demonstrated that this new method is capable of reporting HLA alleles at a high-digit resolution with great accuracy. I also conducted a preliminary analysis of WES data from a set of NGS samples generated in-house. HLA calling results demonstrated consistent allele frequencies to those recorded in the Allele Frequency Net Database (AFND) of the same population. In addition, I also used the typing results from the NGS samples to design population-specific primers and probes. Finally, two set of samples with Crohn’s disease and tuberculosis were studied using this method. Results showed that the previous findings related to HLA and these diseases were effectively replicated in our dataset using the proposed method. Certain interesting results that were not yet reported were also observed in our dataset. These preliminary results highlighted the potential applications of this method for HLA calling from NGS data.
DegreeDoctor of Philosophy
SubjectHLA histocompatibility antigens
Nucleotide sequence
Dept/ProgramPaediatrics and Adolescent Medicine
Persistent Identifierhttp://hdl.handle.net/10722/235792
HKU Library Item IDb5736651

 

DC FieldValueLanguage
dc.contributor.authorHuang, Yazhi-
dc.contributor.author黄亚志-
dc.date.accessioned2016-10-21T23:26:02Z-
dc.date.available2016-10-21T23:26:02Z-
dc.date.issued2015-
dc.identifier.citationHuang, Y. [黄亚志]. (2015). HLA typing from next generation sequencing data : strategies, methods, and applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. Retrieved from http://dx.doi.org/10.5353/th_b5736651.-
dc.identifier.urihttp://hdl.handle.net/10722/235792-
dc.description.abstractHLA genes play a key role in the human immune system. As a result, HLA typing is very important for both clinical laboratories and biomedical research. However, HLA typing has always been challenging due to the complexity of this group of genes, including existence of large number of alleles for most HLA genes, major sequence difference between these alleles, sequence similarity among the paralogous HLA genes, and long range linkage disequilibrium (LD) in this region. With the development of NGS, large amount of sequencing data are becoming widely available. Although most of them were not generated for this purpose, they still provide valuable resources for HLA typing. NGS data might be useful in many aspects such as preliminary screening for potential organ donors, for individuals that are potentially susceptible to adverse drug responses, risk prediction for complex diseases, and population genetic studies. However, due to the complexity of the HLA loci, the large amount of NGS data has not been made informative on HLA genotypes. Many efforts have been made on HLA typing by mining NGS data, including the alignment-based method that relies on counting the number of short reads aligned to each specific allele, the assembly- and scoring-based method that takes into account of good quality contigs and their scores for each candidate HLA allele. These methods capitalize on the increasing accessibility and affordability of NGS sequencing and have greatly reduced the time and cost required to make an HLA call comparing to traditional standard PCR-based solutions. Unfortunately, these methods are only capable of achieving low-digit resolution and perform poorly at higher-digit resolution, which is required for clinical applications. In this study, I introduce a novel approach for accurate HLA typing at high-digit resolution based on a strategy of comparing sequence reads to a comprehensive reference panel containing all the known HLA alleles for high efficiency mapping, followed by assembly of the mapped reads to contigs, stepwise matching and designation of the contigs to HLA alleles and decision on HLA allele calling. Testing of the method on a set of public and internal whole exome sequencing (WES) data demonstrated that this new method is capable of reporting HLA alleles at a high-digit resolution with great accuracy. I also conducted a preliminary analysis of WES data from a set of NGS samples generated in-house. HLA calling results demonstrated consistent allele frequencies to those recorded in the Allele Frequency Net Database (AFND) of the same population. In addition, I also used the typing results from the NGS samples to design population-specific primers and probes. Finally, two set of samples with Crohn’s disease and tuberculosis were studied using this method. Results showed that the previous findings related to HLA and these diseases were effectively replicated in our dataset using the proposed method. Certain interesting results that were not yet reported were also observed in our dataset. These preliminary results highlighted the potential applications of this method for HLA calling from NGS data.-
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshHLA histocompatibility antigens-
dc.subject.lcshNucleotide sequence-
dc.titleHLA typing from next generation sequencing data : strategies, methods, and applications-
dc.typePG_Thesis-
dc.identifier.hkulb5736651-
dc.description.thesisnameDoctor of Philosophy-
dc.description.thesislevelDoctoral-
dc.description.thesisdisciplinePaediatrics and Adolescent Medicine-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.5353/th_b5736651-
dc.identifier.mmsid991019345039703414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats