Document Type

Thesis

Degree Name

Master of Applied Computing

Department

Physics and Computer Science

Program Name/Specialization

Applied Computing

Faculty/School

Faculty of Science

First Advisor

Emad Mohammed

Advisor Role

Supervisor

Second Advisor

Saiqa Aleem

Advisor Role

Co-Supervisor

Abstract

Deploying deep learning models for medical image analysis on mobile devices requires a balance between inference latency, memory footprint, and delineating anatomical boundaries with high accuracy. While Convolutional Neural Networks (CNNs) and mobile Vision Transformers (ViTs) offer efficiency, they often struggle to model the irregular, non-local geometric structures inherent in biological tissues without incurring prohibitive computational costs. In this thesis, we introduce GeoViG (Geometric Vision Graph), an architecture that bridges the gap between efficient grid-based processing and explicit Geometric Deep Learning. GeoViG introduces a novel transition from high-resolution pixel grids to low-resolution dynamic graphs via a SpreadEdgePool operator, a geometry-aware downsampling mechanism. This operator aggregates features based on diffusion distance rather than fixed spatial strides, effectively preserving fine-grained structural diversity while reducing dimensionality. Experimental results show that GeoViG achieves a Top-1 accuracy of up to 82.38% on ImageNet-1K and competitive mean average precision (mAP) on MS COCO, utilizing 30% fewer parameters with up to a 2.35× speedup on mobile GPUs (iPhone 13 GPU). Crucially, for medical segmentation tasks on Kvasir-SEG and DSB 2018, GeoViG outperforms significantly larger models in boundary adherence. GeoViG achieves a Dice Score of 0.945 (vs. 0.875 for ResNet50) while reducing the Hausdorff Distance by over 5× (from 70.37 to 12.94). GeoViG eliminates outlier artifacts and captures fine-grained irregular anatomical structures, making it suitable for portable medical diagnostics. This work also explores the foundation for PureViG architectures, aiming for fully graph-based visual reasoning.

Recommended Citation

Ismail, Omar, "GeoViG and PureViG: geometry-aware architectures for efficient computer vision" (2026). Theses and Dissertations (Comprehensive). 2891.
https://scholars.wlu.ca/etd/2891

Convocation Year

2026

Convocation Season

Spring

Download

Included in

Algebraic Geometry Commons, Artificial Intelligence and Robotics Commons, Geometry and Topology Commons, Graphics and Human Computer Interfaces Commons, Numerical Analysis and Scientific Computing Commons, Other Medicine and Health Sciences Commons, Software Engineering Commons, Systems Architecture Commons, Theory and Algorithms Commons

COinS

Theses and Dissertations (Comprehensive)

GeoViG and PureViG: geometry-aware architectures for efficient computer vision

Document Type

Degree Name

Department

Program Name/Specialization

Faculty/School

First Advisor

Advisor Role

Second Advisor

Advisor Role

Abstract

Recommended Citation

Convocation Year

Convocation Season

Included in

Search

Browse

Author Corner

Links

Theses and Dissertations (Comprehensive)

GeoViG and PureViG: geometry-aware architectures for efficient computer vision

Author

Document Type

Degree Name

Department

Program Name/Specialization

Faculty/School

First Advisor

Advisor Role

Second Advisor

Advisor Role

Abstract

Recommended Citation

Convocation Year

Convocation Season

Included in

Share

Search

Browse

Author Corner

Links