Document Type
Thesis
Degree Name
Master of Applied Computing
Department
Physics and Computer Science
Program Name/Specialization
Applied Computing
Faculty/School
Faculty of Science
First Advisor
Dr. Sukhjit Singh Sehra
Advisor Role
Thesis Supervision
Second Advisor
Dr. Dariush Ebrahimi
Advisor Role
Thesis Co-supervision
Abstract
Intelligent transportation systems (ITS) depend on accurate traffic prediction to support congestion management, infrastructure planning, and real-time operational decisions. Despite substantial progress in data-driven forecasting, several challenges continue to limit practical deployment: traffic data is distributed across independent regional authorities, making centralized aggregation infeasible, standard federated aggregation strategies ignore traffic-specific characteristics that meaningfully affect model quality, and existing models produce only numerical outputs without interpretable reasoning that urban planners can act upon. This thesis addresses these challenges through four contributions that collectively advance privacy-preserving, explainable, and scalable traffic forecasting.
The first contribution provides a systematic review of 129 peer-reviewed publications, establishing the methodological landscape of large language model (LLM) applications in ITS, categorizing integration strategies and connecting architectural choices to empirical performance. Domain-adapted models consistently outperform prompt-only approaches, while interpretability and privacy-preserving deployment emerge as the two most consequential open challenges. These findings directly motivated the subsequent contributions. Building on this foundation, the second contribution proposed a traffic-aware federated learning (FL) framework that replaces standard weighting averaging with a dynamic client scoring mechanism driven by six domain-specific metrics. When combined with traffic-informed client clustering and an attention-augmented architecture, the framework predicts both traffic flow and speed across multiple future horizons, demonstrating consistent improvements over standard federated and centralized baselines under heterogeneous non-IID conditions.
The third contribution designs a domain-adapted LLM fine-tuned on structured traffic prompts encoding sensor metadata, spatial neighbour dynamics, demand statistics, and temporal context, producing both numerical predictions and natural language explanations. The fourth contribution integrates domain-adapted LLM with federated training into a unified federated LLM (FedLLM) framework. This is, to the best of our knowledge, the first of its kind implementation for freeway traffic flow prediction. Independent clients train locally and exchange only lightweight adapter parameters, substantially reducing communication overhead while keeping raw data private. The framework outperforms several centralized and federated baselines and generalizes to unseen geographic regions without retraining.
Collectively, these contributions demonstrate that privacy-preserving decentralized training, human-interpretable predictions, and cross-region generalization are simultaneously achievable within a single deployable framework, offering transportation agencies, city planners, and mobility researchers a practical foundation for next-generation traffic management systems that respect data ownership, adapt to local conditions, and explain their outputs.
Recommended Citation
Kaur, Seerat, "A Novel Federated LLM Framework for Distributed Traffic Modelling in Intelligent Transportation Systems" (2026). Theses and Dissertations (Comprehensive). 2902.
https://scholars.wlu.ca/etd/2902
Convocation Year
2026
Convocation Season
Spring
Included in
Architectural Technology Commons, Artificial Intelligence and Robotics Commons, Computational Engineering Commons, Computer and Systems Architecture Commons, Data Science Commons, Systems Architecture Commons, Transportation Engineering Commons, Urban, Community and Regional Planning Commons