Analysis and inference for networks in bio-informatics and other applications

Wei, Xiguang; 魏锡光

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Analysis and inference for networks in bio-informatics and other applications

Title	Analysis and inference for networks in bio-informatics and other applications
Authors	Wei, Xiguang 魏锡光
Advisors	Advisor(s):Chan, SC
Issue Date	2016
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Wei, X. [魏锡光]. (2016). Analysis and inference for networks in bio-informatics and other applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Networks are useful models in science and engineering with many applications such as bio-informatics. An important problem is network analysis where one aims to estimate the relations among the quantities from measurements, from which further inference can be carried out. For example, by analyzing the gene interactions from gene microarray or other data may help to improve our understanding of biological processes and more importantly underlying mechanism leading to various diseases. Images or videos can also be viewed as a network with strong local and inter frame correlation. With the advancement of modern information collection techniques, a large amount of quantitative measurements can be collected, which present great challenges to network analysis and inference due to the significantly increased numbers of variables for describing the relationships. Consequently, the associated complexity and dimensionality grow dramatically. Moreover, in group-based or longitudinal-based studies, multiple networks and dynamic networks are encountered, which further complicate the network analysis problem. Validation is also a critical issue in gene microarray data analysis due to the absence of ground truth. This thesis addresses these challenges by proposing several novel extensions to existing network analysis, inference techniques and focuses on practical applications in bio-informatics and target tracking in ultrasound videos. First, a new inter-network analysis method is proposed for identifying the difference in the network connections among different classes of data or different time points in time-series data. The proposed approach first identifies a set of “class specific features” and then the connections between these features and other remaining features. In particular, we propose a novel consensus gene selection criterion for partial least square (PLS) regression, which is able to identify more comprehensive but essential features by randomly partitioning of the training data of assessing their relative importance through the rank statistic. Through regressing these genes with the remaining genes for each class, “class specific subnetwork analysis” is performed. As the entire network may contain thousands of genes and millions of connections, an efficient parallel implementation based on the graphic processing unit (GPU) is also proposed to reduce the computational time. Second, a systematic method is proposed for analyzing large-scale time-course gene microarray data. The proposed approach first identifies putative genes and their interactions from time-course gene microarray data. Then, these identified genes and interactions are used for discovering new findings. In particular, we proposed a novel enrichment method, which is able to validate the statistical significance of the putative interactions by comparing these interactions with existing knowledge of interactions summarized in related databases. In this way, although no ground truth is available, the statistical validation results can still quantify the statistical significance of identified networks. Finally, a new target tracking algorithm for ultrasound videos is proposed. In particular, a novel position and intensity transformation is proposed to establish the correspondence between pixels in two frames. The usefulness of this proposed is verified by tracking Central Tendon of Rectus Femoris (CT-RF) in ultrasound videos. Experimental results on 20 subjects show that the proposed algorithm can better adapt to the intensity and shape variation in the image sequences and offers better tracking performance than conventional methods.
Degree	Doctor of Philosophy
Subject	Network analysis (Planning) Bioinformatics
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/282309

DC Field	Value	Language
dc.contributor.advisor	Chan, SC	-
dc.contributor.author	Wei, Xiguang	-
dc.contributor.author	魏锡光	-
dc.date.accessioned	2020-05-07T07:17:19Z	-
dc.date.available	2020-05-07T07:17:19Z	-
dc.date.issued	2016	-
dc.identifier.citation	Wei, X. [魏锡光]. (2016). Analysis and inference for networks in bio-informatics and other applications. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/282309	-
dc.description.abstract	Networks are useful models in science and engineering with many applications such as bio-informatics. An important problem is network analysis where one aims to estimate the relations among the quantities from measurements, from which further inference can be carried out. For example, by analyzing the gene interactions from gene microarray or other data may help to improve our understanding of biological processes and more importantly underlying mechanism leading to various diseases. Images or videos can also be viewed as a network with strong local and inter frame correlation. With the advancement of modern information collection techniques, a large amount of quantitative measurements can be collected, which present great challenges to network analysis and inference due to the significantly increased numbers of variables for describing the relationships. Consequently, the associated complexity and dimensionality grow dramatically. Moreover, in group-based or longitudinal-based studies, multiple networks and dynamic networks are encountered, which further complicate the network analysis problem. Validation is also a critical issue in gene microarray data analysis due to the absence of ground truth. This thesis addresses these challenges by proposing several novel extensions to existing network analysis, inference techniques and focuses on practical applications in bio-informatics and target tracking in ultrasound videos. First, a new inter-network analysis method is proposed for identifying the difference in the network connections among different classes of data or different time points in time-series data. The proposed approach first identifies a set of “class specific features” and then the connections between these features and other remaining features. In particular, we propose a novel consensus gene selection criterion for partial least square (PLS) regression, which is able to identify more comprehensive but essential features by randomly partitioning of the training data of assessing their relative importance through the rank statistic. Through regressing these genes with the remaining genes for each class, “class specific subnetwork analysis” is performed. As the entire network may contain thousands of genes and millions of connections, an efficient parallel implementation based on the graphic processing unit (GPU) is also proposed to reduce the computational time. Second, a systematic method is proposed for analyzing large-scale time-course gene microarray data. The proposed approach first identifies putative genes and their interactions from time-course gene microarray data. Then, these identified genes and interactions are used for discovering new findings. In particular, we proposed a novel enrichment method, which is able to validate the statistical significance of the putative interactions by comparing these interactions with existing knowledge of interactions summarized in related databases. In this way, although no ground truth is available, the statistical validation results can still quantify the statistical significance of identified networks. Finally, a new target tracking algorithm for ultrasound videos is proposed. In particular, a novel position and intensity transformation is proposed to establish the correspondence between pixels in two frames. The usefulness of this proposed is verified by tracking Central Tendon of Rectus Femoris (CT-RF) in ultrasound videos. Experimental results on 20 subjects show that the proposed algorithm can better adapt to the intensity and shape variation in the image sequences and offers better tracking performance than conventional methods.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Network analysis (Planning)	-
dc.subject.lcsh	Bioinformatics	-
dc.title	Analysis and inference for networks in bio-informatics and other applications	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2017	-
dc.identifier.mmsid	991044229568803414	-

File Download

Supplementary

postgraduate thesis: Analysis and inference for networks in bio-informatics and other applications

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats