Semantic Segmentation on Remotely Sensed Images Using Deep Convolutional Encoder–Decoder Neural Networks
A research-driven deep learning framework pushing the boundaries of remote sensing scene understanding.
Semantic segmentation plays a crucial role in applications spanning agriculture, map updating, urban planning, and disaster response. Yet existing encoder–decoder networks often fail to identify fine low-level features such as rivers and vegetation.
FusionNetGeoLabel proposes an advanced framework specifically designed for aerial and satellite imagery, systematically addressing resolution loss, feature leakage, and limited domain-specific training data.
Five architectural innovations unified in a single framework.
Enhances segmentation accuracy by capturing large receptive fields while maintaining classification capability at every spatial position.
Focuses network resources on the most discriminative feature channels, improving sensitivity to semantically critical regions.
Addresses the scarcity of annotated remote sensing data by adapting representations from closely related aerial domains.
Bridges low-level textures with high-level semantics to recover fine details such as rivers, vegetation lines, and road boundaries.
Expands receptive fields at multiple scales efficiently, enabling simultaneous global context and local nuance detection.
Maintains multi-resolution representations throughout the network, ensuring sharper boundaries and more reliable object detection.
HR-GCN-FF-DA: A unified, modular architecture validated on four major benchmarks.
FusionNetGeoLabel consistently outperforms prior baselines on ISPRS Vaihingen, Potsdam, Landsat-8, and Massachusetts Roads.
| Method | IoU ↑ | F1-Score ↑ | OA ↑ | Dataset |
|---|---|---|---|---|
| U-Net Baseline | 72.4 | 82.7 | 85.3 | ISPRS Vaihingen |
| SegNet | 74.1 | 84.3 | 86.9 | ISPRS Vaihingen |
| GCN + Baseline | 78.5 | 87.2 | 88.4 | ISPRS Vaihingen |
| GCN + CA + Transfer | 82.3 | 89.5 | 90.6 | ISPRS Vaihingen |
| HR-GCN-FF-DA (Ours) | 86.8 | 92.1 | 93.4 | ISPRS Vaihingen |
Qualitative segmentation outputs on held-out test scenes.
Full semantic segmentation pipeline — train, infer, and evaluate in minutes.
git clone https://github.com/kaopanboonyuen/FusionNetGeoLabel.git cd FusionNetGeoLabel pip install -r requirements.txt
bash scripts/download_isprs.sh bash scripts/download_potsdam.sh bash scripts/download_massachusetts.sh
python train.py --config config.json
python inference.py \ --model checkpoints/hrgcn_ff_da.pth \ --image sample.png
python test.py \ --config config.json \ --model checkpoints/hrgcn_ff_da.pth
docker build -t fusionnetgeolabel . docker run --gpus all -it fusionnetgeolabel
Peer-reviewed research derived from this doctoral work.
Supported by prestigious royal and university scholarships.
Chulalongkorn University, 2018–2020
Thesis: Semantic Segmentation on Remotely Sensed Images Using Deep Convolutional Encoder-Decoder Neural Network
Chulalongkorn University, 2016–2017
Thesis: High-Resolution Road Extraction Using Deep Convolutional Neural Networks and Conditional Random Fields
Research powering intelligent geospatial systems at scale.
Automated aerial scene parsing supports data-driven city development and infrastructure assessment at national scale.
Precise vegetation and land-use segmentation enables continuous, large-area ecosystem tracking.
Rapid post-event scene understanding accelerates damage assessment and relief coordination in emergency scenarios.
High-accuracy map updating from satellite imagery reduces manual cartographic effort and improves timeliness.
If you use FusionNetGeoLabel in your research, please cite the following.
@phdthesis{panboonyuen2019semantic,
title = {Semantic segmentation on remotely sensed images using deep
convolutional encoder-decoder neural network},
author = {Teerapong Panboonyuen},
year = {2019},
school = {Chulalongkorn University},
type = {Ph.D. thesis},
doi = {10.58837/CHULA.THE.2019.158},
address = {Faculty of Engineering},
note = {Doctor of Philosophy}
}
MIT License · opensource.org/licenses/MIT