File Download
Supplementary

postgraduate thesis: SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder

TitleSNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder
Authors
Issue Date2025
PublisherThe University of Hong Kong (Pokfulam, Hong Kong)
Citation
Chung, H. M. [鍾凱文]. (2025). SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
AbstractRecently, single-cell lineage tracing experiments using random passenger DNA mutations as genetic markers become popular, because they are non-invasive and can be applied to many biological samples without biochemical intervention. However, SNV clone assignment and lineage inference in this type of data remain a challenge due to hierarchical mutation structure and many missing signals. Existing SNV analysis methods were mainly designed for simpler mutation patterns with low covariance in different SNPs, so they do not work well in lineage tracing data with high covariance in different SNPs due to hierarchical evolutionary history. Applying them to lineage tracing data easily results in abnormal convergence and inaccurate conclusions. To solve this problem, I developed SNPmanifold, a Python package that learns an embedding manifold with a better cell-cell distance metric using a shallow binomial variational autoencoder. Based on this better cell-cell distance metric, SNPmanifold can avoid numerical issues brought by high covariance in different SNPs and assign SNV clones and lineages more accurately based on neighboring cells. Also, visualization of hierarchical mutation structure becomes clear under UMAP with this better cell-cell distance metric. In this thesis, I demonstrated that SNPmanifold can effectively identify a large number of multiplexed donors of origin (k = 18) with nuclear SNPs that all existing unsupervised methods fail, and lineages of mitochondrial somatic clones with meaningful biological interpretations such as agreements with cell types, CNV clones, and TRB clonotypes. In short, SNPmanifold is a better SNV analysis method for high-covariance single-cell lineage tracing data that I developed in this thesis.
DegreeMaster of Philosophy
SubjectCytology
Cells - Analysis
Machine learning
Bioinformatics
Dept/ProgramBiomedical Sciences
Persistent Identifierhttp://hdl.handle.net/10722/364024

 

DC FieldValueLanguage
dc.contributor.authorChung, Hoi Man-
dc.contributor.author鍾凱文-
dc.date.accessioned2025-10-20T02:56:36Z-
dc.date.available2025-10-20T02:56:36Z-
dc.date.issued2025-
dc.identifier.citationChung, H. M. [鍾凱文]. (2025). SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.-
dc.identifier.urihttp://hdl.handle.net/10722/364024-
dc.description.abstractRecently, single-cell lineage tracing experiments using random passenger DNA mutations as genetic markers become popular, because they are non-invasive and can be applied to many biological samples without biochemical intervention. However, SNV clone assignment and lineage inference in this type of data remain a challenge due to hierarchical mutation structure and many missing signals. Existing SNV analysis methods were mainly designed for simpler mutation patterns with low covariance in different SNPs, so they do not work well in lineage tracing data with high covariance in different SNPs due to hierarchical evolutionary history. Applying them to lineage tracing data easily results in abnormal convergence and inaccurate conclusions. To solve this problem, I developed SNPmanifold, a Python package that learns an embedding manifold with a better cell-cell distance metric using a shallow binomial variational autoencoder. Based on this better cell-cell distance metric, SNPmanifold can avoid numerical issues brought by high covariance in different SNPs and assign SNV clones and lineages more accurately based on neighboring cells. Also, visualization of hierarchical mutation structure becomes clear under UMAP with this better cell-cell distance metric. In this thesis, I demonstrated that SNPmanifold can effectively identify a large number of multiplexed donors of origin (k = 18) with nuclear SNPs that all existing unsupervised methods fail, and lineages of mitochondrial somatic clones with meaningful biological interpretations such as agreements with cell types, CNV clones, and TRB clonotypes. In short, SNPmanifold is a better SNV analysis method for high-covariance single-cell lineage tracing data that I developed in this thesis.en
dc.languageeng-
dc.publisherThe University of Hong Kong (Pokfulam, Hong Kong)-
dc.relation.ispartofHKU Theses Online (HKUTO)-
dc.rightsThe author retains all proprietary rights, (such as patent rights) and the right to use in future works.-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subject.lcshCytology-
dc.subject.lcshCells - Analysis-
dc.subject.lcshMachine learning-
dc.subject.lcshBioinformatics-
dc.titleSNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder-
dc.typePG_Thesis-
dc.description.thesisnameMaster of Philosophy-
dc.description.thesislevelMaster-
dc.description.thesisdisciplineBiomedical Sciences-
dc.description.naturepublished_or_final_version-
dc.date.hkucongregation2025-
dc.identifier.mmsid991045117393403414-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats