Automating Topology Corrections for Fragmented Network
Project Overview
Many real-world networks, from transportation to utility grids, often suffer from fragmentation. Fragmented networks contain disconnected subgraphs, missing links, and irregular topological structures, making them inefficient for routing, analysis, or simulation. While vehicular road networks have received extensive efforts in ensuring topological coherence, networks like pedestrian infrastructure, bike lanes, and underground utility networks remain incomplete, requiring manual interventions that are costly and unscalable.
This research aims to develop an automated system for correcting fragmented network structures by integrating graph-based diffusion models, large language model (LLM) reasoning, and multi-modal data validation. Using pedestrian networks as a case study, we demonstrate how our approach generalizes to broader network correction tasks.
Figure 1. Left: Raw polygon centerline results from a semantic segmentation model. Right: Manually corrected network of the same area, prioritizing connectivity, routability, and simplicity.
Background and Motivation
Transportation networks, especially those designed for pedestrians and cyclists, often exist in fragmented states due to incomplete mapping, detection errors, or actual infrastructure gaps. Current semantic segmentation models can achieve up to 89% mean Intersection over Union (IoU) in identifying sidewalks, footpaths, and crossings from aerial imagery. However, converting these detected features into coherent, routable networks presents significant challenges.
The U.S. Department of Transportation (DOT) has recently launched initiatives to explore scalable methods for generating and refining such networks, highlighting the importance of this research area. The primary challenges include:
- Fragmented networks with missing links
- Disconnected nodes and subgraphs
- Irregular geometries that don’t follow logical paths
- Topological inconsistencies preventing effective routing
Research Gap and Innovation
This research addresses several critical gaps in the current approaches to network reconstruction:
Adapting Diffusion Models for Network Reconstruction
While diffusion models show promise in trajectory recovery, their direct application to fragmented network reconstruction remains unexplored. Challenges include:
- Lack of spatial and geographic constraints
- Absence of city-normalization priors
- Risk of generating implausible or unsafe connections
Our approach adapts diffusion-based state propagation for network graphs, introducing anisotropic constraints for alignment with real-world layouts.
LLM-Based Chain of Thought Methods
LLM-based Chain of Thought (CoT) reasoning provides a structured way to infer missing network connections by simulating urban planners’ decision-making processes. We are developing:
- Robust evaluation frameworks for LLM error detection and validation
- Multi-modal LLM models that integrate textual reasoning with spatial constraints
- Methods to maintain spatial consistency in LLM-generated network inferences
Local-Buffered Learning
We’re developing hierarchical learning models that:
- Capture fine pedestrian details (e.g., sidewalk crossings)
- Preserve broader urban alignment
- Balance fine-grained connectivity with large-scale network coherence
- Create scalable and adaptable reconstruction methodologies
Methodology
Our approach integrates several innovative techniques:
1. Preprocessing Network Data for LLMs
Since LLMs primarily process text-based inputs, we optimize data representation through:
- Human-readable adjacency lists that explicitly encode spatial relationships
- Structured text formats optimized for LLM processing
- Enhanced metadata encoding including sidewalk types, crosswalk locations, and accessibility constraints
2. Types of Network Fixes Categorization
To classify modifications and evaluate model performance, we define fix types based on differences between original and corrected networks:
- Unchanged: Segment remains identical in both datasets
- Modified: Segment altered while retaining >50% overlap with original geometry
- Split: Single segment divided into multiple segments covering >90% of original geometry
- Merge: Multiple segments combined into a single segment retaining all input geometries
- Extension: Segment lengthened at endpoints with ≤10% modification to middle geometry
- Deleted: Segment in original dataset with no corresponding geometry in corrected network
3. Evaluation Matrix
We assess model effectiveness using four key performance categories:
Performance Category | Explanation | Goal |
---|---|---|
Geometric Similarity | Measures alignment with manually corrected network | Ensuring correct spatial geometry |
Fix Action Accuracy | Evaluates implementation of necessary modifications | Measuring LLM compliance with user instructions |
Dead-End Analysis | Assesses resolution of isolated segments | Ensuring fully integrated and navigable networks |
Connectivity Improvement | Measures enhancement of overall network connectivity | Improving overall network usability |
4. Scaling to Larger Networks
Once validated on small-scale pedestrian networks, we expand testing to larger and more complex networks covering entire urban areas by:
- Handling increased data volume while maintaining efficient processing
- Addressing challenges like varying road densities and mixed-use pathways
- Ensuring LLM consistency across diverse pedestrian infrastructures
Data Sources
This project utilizes urban pedestrian network datasets provided by Professor Andres Sevtsuk from City Form Lab, DUSP. The dataset includes pedestrian networks from major U.S. cities such as New York City, Boston, and Los Angeles, containing road segments with attributes like sidewalk width, connectivity, and accessibility features.
Expected Outcomes
This research will deliver:
- Improved pedestrian network connectivity through automated correction of fragmented segments
- Optimized LLM processing through structured input representations and refined prompts
- Validation of LLM performance across different models, input formats, and reasoning approaches
- Reliable evaluation metrics to assess geometric accuracy, fix precision, dead-end reduction, and connectivity enhancement
- A scalable methodology applicable to broader network reconstruction tasks beyond pedestrian pathways
Applications and Future Work
The developed methodology has potential applications in:
- Urban Planning and Transportation: Creating complete, routable pedestrian and cycling networks
- Utility Network Management: Reconstructing fragmented underground utility networks
- Emergency Response: Improving evacuation route planning and accessibility mapping
- Smart City Development: Supporting comprehensive mobility planning across transportation modes
Future work will focus on:
- Extending the approach to other types of networks beyond transportation
- Integrating real-time data sources for dynamic network corrections
- Developing user interfaces for semi-automated network editing with LLM assistance