Author: Teerapong Panboonyuen (also known as Kao Panboonyuen)
Project: MeViT: A Medium-Resolution Vision Transformer
Publication: MeViT: A Medium-Resolution Vision Transformer for Semantic Segmentation on Landsat Satellite Imagery for Agriculture in Thailand
Semantic segmentation is a crucial task in remote sensing, focused on classifying every pixel within an image for various land use and land cover (LULC) applications. This project introduces MeViT (Medium-Resolution Vision Transformer), a novel approach tailored for Landsat satellite imagery of key economic crops in Thailand, including para rubber, corn, and pineapple. Our method enhances Vision Transformers (ViTs) by integrating medium-resolution, multi-branch architectures, optimized for semantic segmentation. The revised MixCFN (Mixed-Scale Convolutional Feedforward Networks) block within MeViT incorporates multiple depth-wise convolution paths, effectively balancing performance and efficiency.
Through extensive experimentation on Thailand’s satellite scenes, MeViT has demonstrated superior performance over state-of-the-art deep learning methods, achieving a precision of 92.22%, recall of 94.69%, F1 score of 93.44%, and mean IoU of 83.63%.
Clone the repository and install the required dependencies:
git clone https://github.com/kaopanboonyuen/MeViT.git
cd MeViT
pip install -r requirements.txt
config.yaml
file with your specific dataset paths and parameters. python train.py
python evaluate.py
python inference.py
For more details, visit the project website.
If you use this project in your research, please cite our work:
@article{panboonyuen2023mevit,
title={MeViT: A Medium-Resolution Vision Transformer for Semantic Segmentation on Landsat Satellite Imagery for Agriculture in Thailand},
author={Panboonyuen, Teerapong and Charoenphon, Chaiyut and Satirapod, Chalermchon},
journal={Remote Sensing},
volume={15},
number={21},
pages={5124},
year={2023},
publisher={MDPI}
}
This project is licensed under the MIT License. See the LICENSE file for details.
This work is based on research presented at a conference. Special thanks to our collaborators and contributors who supported the development of MeViT.
For any questions or contributions, feel free to open an issue or submit a pull request. We appreciate your interest in our work!