Generalization in autonomous driving systems

Kou, Weibin; 寇卫斌

File Download

FullText.pdf

Supplementary

Citations:
Appears in Collections:
- HKU Theses Online
- Electrical & Electronic Engineering: Theses

postgraduate thesis: Generalization in autonomous driving systems

Title	Generalization in autonomous driving systems
Authors	Kou, Weibin 寇卫斌
Advisors	Advisor(s):Wu, YC
Issue Date	2025
Publisher	The University of Hong Kong (Pokfulam, Hong Kong)
Citation	Kou, W. [寇卫斌]. (2025). Generalization in autonomous driving systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.
Abstract	Deep Learning (DL)-based autonomous driving (AD) develops rapidly. However, AD models generally face poor generalization. Specifically, on the one hand, AD models trained from data in a particular geographical region perform poorly when applied in other regions due to data domain-shift; on the other hand, AD models generally underperform in adverse weather conditions owing to invisibility or lack of structural information. To tackle the geographically related generalization problem, Hierarchical Federated Learning (HFL) offers a potential solution by collaborative privacy-preserving training over distributed datasets from different regions. Unfortunately, existing HFL suffers from three major challenges: (I) slow convergence because data from different cities are with disparate statistical properties; (II) inefficient use of communication resource owing to HFL's traditional static resource scheduling policy; (III) disregard of limited communication resource in practice. This thesis will introduce strategies to solve these challenges. In addition, to improve the AD model generalization in diverse weather conditions, a generalizable scheme without relying on clear reference images is proposed. To solve the HFL challenges, this thesis firstly proposes FedGau to accelerate convergence. FedGau models the statistical distribution of both single RGB image and RGB datasets as Gaussian distributions, allowing for aggregation weights to be designed based on data statistical properties rather than purely based on volume. Alongside FedGau, this thesis introduces AdapRS, which dynamically adjusts the number of model exchanges based on real-time performance metrics of the global model. Comprehensive experiments show that FedGau accelerates the HFL model convergence and AdapRS significantly reduces unnecessary communication overhead. On the other hand, to address the communication resource constraint in HFL, this thesis proposes CRCHFL framework. CRCHFL prioritizes different stages of the HFL process to minimize generalization errors. Extensive experiments demonstrate CRCHFL outperforms existing FL methods under strict communication resource constraints. After above HFL challenges are tackled, AD models can enhance generalization via training on large-scale and diverse data from various regions. As time progresses, the data involved in the federated AD system expands. Such ever-expanding data may eventually result in the model under-fitting, leading to poor generalization. To handle this under-fitting limitation, this thesis explores the potential of Large Vision Models (LVMs) to rectify the problem. Precisely, this thesis proposes a pFedLVM framework, which incorporates LVMs into federated AD systems, and exchanges latent features instead of entire LVM parameters between the central server and participating vehicles. This significantly reduces communication demands while allowing for personalized model updates tailored to local driving conditions. Finally, this thesis improves AD model generalization in adverse weather conditions through a novel strategy termed AdvImmu. This approach, which integrates techniques such as Locally Sequential Mechanism (LSM), Globally Shuffled Mechanism (GSM), and Unfolded Regularizers (URs), enhances model robustness across various weather conditions without relying on clear weather references. Additionally, to address the challenge of lacking frame-wise annotations in adverse conditions, AdvImmu incorporates the foundation model SAM with a clustering algorithm SBICAC to generate pseudo-labels, thereby enabling effective training in complex scenarios.
Degree	Doctor of Philosophy
Subject	Automated vehicles
Dept/Program	Electrical and Electronic Engineering
Persistent Identifier	http://hdl.handle.net/10722/356604

DC Field	Value	Language
dc.contributor.advisor	Wu, YC	-
dc.contributor.author	Kou, Weibin	-
dc.contributor.author	寇卫斌	-
dc.date.accessioned	2025-06-05T09:31:24Z	-
dc.date.available	2025-06-05T09:31:24Z	-
dc.date.issued	2025	-
dc.identifier.citation	Kou, W. [寇卫斌]. (2025). Generalization in autonomous driving systems. (Thesis). University of Hong Kong, Pokfulam, Hong Kong SAR.	-
dc.identifier.uri	http://hdl.handle.net/10722/356604	-
dc.description.abstract	Deep Learning (DL)-based autonomous driving (AD) develops rapidly. However, AD models generally face poor generalization. Specifically, on the one hand, AD models trained from data in a particular geographical region perform poorly when applied in other regions due to data domain-shift; on the other hand, AD models generally underperform in adverse weather conditions owing to invisibility or lack of structural information. To tackle the geographically related generalization problem, Hierarchical Federated Learning (HFL) offers a potential solution by collaborative privacy-preserving training over distributed datasets from different regions. Unfortunately, existing HFL suffers from three major challenges: (I) slow convergence because data from different cities are with disparate statistical properties; (II) inefficient use of communication resource owing to HFL's traditional static resource scheduling policy; (III) disregard of limited communication resource in practice. This thesis will introduce strategies to solve these challenges. In addition, to improve the AD model generalization in diverse weather conditions, a generalizable scheme without relying on clear reference images is proposed. To solve the HFL challenges, this thesis firstly proposes FedGau to accelerate convergence. FedGau models the statistical distribution of both single RGB image and RGB datasets as Gaussian distributions, allowing for aggregation weights to be designed based on data statistical properties rather than purely based on volume. Alongside FedGau, this thesis introduces AdapRS, which dynamically adjusts the number of model exchanges based on real-time performance metrics of the global model. Comprehensive experiments show that FedGau accelerates the HFL model convergence and AdapRS significantly reduces unnecessary communication overhead. On the other hand, to address the communication resource constraint in HFL, this thesis proposes CRCHFL framework. CRCHFL prioritizes different stages of the HFL process to minimize generalization errors. Extensive experiments demonstrate CRCHFL outperforms existing FL methods under strict communication resource constraints. After above HFL challenges are tackled, AD models can enhance generalization via training on large-scale and diverse data from various regions. As time progresses, the data involved in the federated AD system expands. Such ever-expanding data may eventually result in the model under-fitting, leading to poor generalization. To handle this under-fitting limitation, this thesis explores the potential of Large Vision Models (LVMs) to rectify the problem. Precisely, this thesis proposes a pFedLVM framework, which incorporates LVMs into federated AD systems, and exchanges latent features instead of entire LVM parameters between the central server and participating vehicles. This significantly reduces communication demands while allowing for personalized model updates tailored to local driving conditions. Finally, this thesis improves AD model generalization in adverse weather conditions through a novel strategy termed AdvImmu. This approach, which integrates techniques such as Locally Sequential Mechanism (LSM), Globally Shuffled Mechanism (GSM), and Unfolded Regularizers (URs), enhances model robustness across various weather conditions without relying on clear weather references. Additionally, to address the challenge of lacking frame-wise annotations in adverse conditions, AdvImmu incorporates the foundation model SAM with a clustering algorithm SBICAC to generate pseudo-labels, thereby enabling effective training in complex scenarios.	-
dc.language	eng	-
dc.publisher	The University of Hong Kong (Pokfulam, Hong Kong)	-
dc.relation.ispartof	HKU Theses Online (HKUTO)	-
dc.rights	The author retains all proprietary rights, (such as patent rights) and the right to use in future works.	-
dc.rights	This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.	-
dc.subject.lcsh	Automated vehicles	-
dc.title	Generalization in autonomous driving systems	-
dc.type	PG_Thesis	-
dc.description.thesisname	Doctor of Philosophy	-
dc.description.thesislevel	Doctoral	-
dc.description.thesisdiscipline	Electrical and Electronic Engineering	-
dc.description.nature	published_or_final_version	-
dc.date.hkucongregation	2025	-
dc.identifier.mmsid	991044970873203414	-

File Download

Supplementary

postgraduate thesis: Generalization in autonomous driving systems

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats