Automating Topology Corrections for Fragmented Network

March 12, 2025 4 minute read

Project Overview

Many real-world networks, from transportation to utility grids, often suffer from fragmentation. Fragmented networks contain disconnected subgraphs, missing links, and irregular topological structures, making them inefficient for routing, analysis, or simulation. While vehicular road networks have received extensive efforts in ensuring topological coherence, networks like pedestrian infrastructure, bike lanes, and underground utility networks remain incomplete, requiring manual interventions that are costly and unscalable.

This research aims to develop an automated system for correcting fragmented network structures by integrating graph-based diffusion models, large language model (LLM) reasoning, and multi-modal data validation. Using pedestrian networks as a case study, we demonstrate how our approach generalizes to broader network correction tasks.

Network Fragmentation Example

Figure 1. Left: Raw polygon centerline results from a semantic segmentation model. Right: Manually corrected network of the same area, prioritizing connectivity, routability, and simplicity.

Background and Motivation

Transportation networks, especially those designed for pedestrians and cyclists, often exist in fragmented states due to incomplete mapping, detection errors, or actual infrastructure gaps. Current semantic segmentation models can achieve up to 89% mean Intersection over Union (IoU) in identifying sidewalks, footpaths, and crossings from aerial imagery. However, converting these detected features into coherent, routable networks presents significant challenges.

The U.S. Department of Transportation (DOT) has recently launched initiatives to explore scalable methods for generating and refining such networks, highlighting the importance of this research area. The primary challenges include:

Fragmented networks with missing links
Disconnected nodes and subgraphs
Irregular geometries that don’t follow logical paths
Topological inconsistencies preventing effective routing

Research Gap and Innovation

This research addresses several critical gaps in the current approaches to network reconstruction:

Adapting Diffusion Models for Network Reconstruction

While diffusion models show promise in trajectory recovery, their direct application to fragmented network reconstruction remains unexplored. Challenges include:

Lack of spatial and geographic constraints
Absence of city-normalization priors
Risk of generating implausible or unsafe connections

Our approach adapts diffusion-based state propagation for network graphs, introducing anisotropic constraints for alignment with real-world layouts.

LLM-Based Chain of Thought Methods

LLM-based Chain of Thought (CoT) reasoning provides a structured way to infer missing network connections by simulating urban planners’ decision-making processes. We are developing:

Robust evaluation frameworks for LLM error detection and validation
Multi-modal LLM models that integrate textual reasoning with spatial constraints
Methods to maintain spatial consistency in LLM-generated network inferences

Local-Buffered Learning

We’re developing hierarchical learning models that:

Capture fine pedestrian details (e.g., sidewalk crossings)
Preserve broader urban alignment
Balance fine-grained connectivity with large-scale network coherence
Create scalable and adaptable reconstruction methodologies

Methodology

Our approach integrates several innovative techniques:

1. Preprocessing Network Data for LLMs

Since LLMs primarily process text-based inputs, we optimize data representation through:

Human-readable adjacency lists that explicitly encode spatial relationships
Structured text formats optimized for LLM processing
Enhanced metadata encoding including sidewalk types, crosswalk locations, and accessibility constraints

2. Types of Network Fixes Categorization

To classify modifications and evaluate model performance, we define fix types based on differences between original and corrected networks:

Unchanged: Segment remains identical in both datasets
Modified: Segment altered while retaining >50% overlap with original geometry
Split: Single segment divided into multiple segments covering >90% of original geometry
Merge: Multiple segments combined into a single segment retaining all input geometries
Extension: Segment lengthened at endpoints with ≤10% modification to middle geometry
Deleted: Segment in original dataset with no corresponding geometry in corrected network

3. Evaluation Matrix

We assess model effectiveness using four key performance categories:

Performance Category	Explanation	Goal
Geometric Similarity	Measures alignment with manually corrected network	Ensuring correct spatial geometry
Fix Action Accuracy	Evaluates implementation of necessary modifications	Measuring LLM compliance with user instructions
Dead-End Analysis	Assesses resolution of isolated segments	Ensuring fully integrated and navigable networks
Connectivity Improvement	Measures enhancement of overall network connectivity	Improving overall network usability

4. Scaling to Larger Networks

Once validated on small-scale pedestrian networks, we expand testing to larger and more complex networks covering entire urban areas by:

Handling increased data volume while maintaining efficient processing
Addressing challenges like varying road densities and mixed-use pathways
Ensuring LLM consistency across diverse pedestrian infrastructures

Data Sources

This project utilizes urban pedestrian network datasets provided by Professor Andres Sevtsuk from City Form Lab, DUSP. The dataset includes pedestrian networks from major U.S. cities such as New York City, Boston, and Los Angeles, containing road segments with attributes like sidewalk width, connectivity, and accessibility features.

Expected Outcomes

This research will deliver:

Improved pedestrian network connectivity through automated correction of fragmented segments
Optimized LLM processing through structured input representations and refined prompts
Validation of LLM performance across different models, input formats, and reasoning approaches
Reliable evaluation metrics to assess geometric accuracy, fix precision, dead-end reduction, and connectivity enhancement
A scalable methodology applicable to broader network reconstruction tasks beyond pedestrian pathways

Applications and Future Work

The developed methodology has potential applications in:

Urban Planning and Transportation: Creating complete, routable pedestrian and cycling networks
Utility Network Management: Reconstructing fragmented underground utility networks
Emergency Response: Improving evacuation route planning and accessibility mapping
Smart City Development: Supporting comprehensive mobility planning across transportation modes

Future work will focus on:

Extending the approach to other types of networks beyond transportation
Integrating real-time data sources for dynamic network corrections
Developing user interfaces for semi-automated network editing with LLM assistance