File Download
Supplementary
-
Citations:
- Appears in Collections:
postgraduate thesis: SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder
| Title | SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder |
|---|---|
| Authors | |
| Issue Date | 2025 |
| Publisher | The University of Hong Kong (Pokfulam, Hong Kong) |
| Citation | Chung, H. M. [鍾凱文]. (2025). SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. |
| Abstract | Recently, single-cell lineage tracing experiments using random passenger DNA mutations as genetic markers become popular, because they are non-invasive and can be applied to many biological samples without biochemical intervention. However, SNV clone assignment and lineage inference in this type of data remain a challenge due to hierarchical mutation structure and many missing signals. Existing SNV analysis methods were mainly designed for simpler mutation patterns with low covariance in different SNPs, so they do not work well in lineage tracing data with high covariance in different SNPs due to hierarchical evolutionary history. Applying them to lineage tracing data easily results in abnormal convergence and inaccurate conclusions. To solve this problem, I developed SNPmanifold, a Python package that learns an embedding manifold with a better cell-cell distance metric using a shallow binomial variational autoencoder. Based on this better cell-cell distance metric, SNPmanifold can avoid numerical issues brought by high covariance in different SNPs and assign SNV clones and lineages more accurately based on neighboring cells. Also, visualization of hierarchical mutation structure becomes clear under UMAP with this better cell-cell distance metric. In this thesis, I demonstrated that SNPmanifold can effectively identify a large number of multiplexed donors of origin (k = 18) with nuclear SNPs that all existing unsupervised methods fail, and lineages of mitochondrial somatic clones with meaningful biological interpretations such as agreements with cell types, CNV clones, and TRB clonotypes. In short, SNPmanifold is a better SNV analysis method for high-covariance single-cell lineage tracing data that I developed in this thesis. |
| Degree | Master of Philosophy |
| Subject | Cytology Cells - Analysis Machine learning Bioinformatics |
| Dept/Program | Biomedical Sciences |
| Persistent Identifier | http://hdl.handle.net/10722/364024 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Chung, Hoi Man | - |
| dc.contributor.author | 鍾凱文 | - |
| dc.date.accessioned | 2025-10-20T02:56:36Z | - |
| dc.date.available | 2025-10-20T02:56:36Z | - |
| dc.date.issued | 2025 | - |
| dc.identifier.citation | Chung, H. M. [鍾凱文]. (2025). SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR. | - |
| dc.identifier.uri | http://hdl.handle.net/10722/364024 | - |
| dc.description.abstract | Recently, single-cell lineage tracing experiments using random passenger DNA mutations as genetic markers become popular, because they are non-invasive and can be applied to many biological samples without biochemical intervention. However, SNV clone assignment and lineage inference in this type of data remain a challenge due to hierarchical mutation structure and many missing signals. Existing SNV analysis methods were mainly designed for simpler mutation patterns with low covariance in different SNPs, so they do not work well in lineage tracing data with high covariance in different SNPs due to hierarchical evolutionary history. Applying them to lineage tracing data easily results in abnormal convergence and inaccurate conclusions. To solve this problem, I developed SNPmanifold, a Python package that learns an embedding manifold with a better cell-cell distance metric using a shallow binomial variational autoencoder. Based on this better cell-cell distance metric, SNPmanifold can avoid numerical issues brought by high covariance in different SNPs and assign SNV clones and lineages more accurately based on neighboring cells. Also, visualization of hierarchical mutation structure becomes clear under UMAP with this better cell-cell distance metric. In this thesis, I demonstrated that SNPmanifold can effectively identify a large number of multiplexed donors of origin (k = 18) with nuclear SNPs that all existing unsupervised methods fail, and lineages of mitochondrial somatic clones with meaningful biological interpretations such as agreements with cell types, CNV clones, and TRB clonotypes. In short, SNPmanifold is a better SNV analysis method for high-covariance single-cell lineage tracing data that I developed in this thesis. | en |
| dc.language | eng | - |
| dc.publisher | The University of Hong Kong (Pokfulam, Hong Kong) | - |
| dc.relation.ispartof | HKU Theses Online (HKUTO) | - |
| dc.rights | The author retains all proprietary rights, (such as patent rights) and the right to use in future works. | - |
| dc.rights | This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. | - |
| dc.subject.lcsh | Cytology | - |
| dc.subject.lcsh | Cells - Analysis | - |
| dc.subject.lcsh | Machine learning | - |
| dc.subject.lcsh | Bioinformatics | - |
| dc.title | SNPmanifold : detecting single-cell SNV clonality and lineages using binomial variational autoencoder | - |
| dc.type | PG_Thesis | - |
| dc.description.thesisname | Master of Philosophy | - |
| dc.description.thesislevel | Master | - |
| dc.description.thesisdiscipline | Biomedical Sciences | - |
| dc.description.nature | published_or_final_version | - |
| dc.date.hkucongregation | 2025 | - |
| dc.identifier.mmsid | 991045117393403414 | - |
