Young Scientists Quickfire Pitch

Abstract

In this quick pitch, I’m thrilled to introduce MeViT—a medium-resolution Vision Transformer developed for high-precision semantic segmentation of Landsat satellite imagery. Designed specifically to handle Thailand’s key agricultural regions, MeViT allows us to classify different land use and land cover types, focusing on economically important crops like para rubber, corn, and pineapple. This application is essential for understanding crop distribution and monitoring agricultural trends at scale. MeViT is built to enhance standard Vision Transformers, integrating multi-scale depth-wise convolutions for better feature capture. This unique architecture provides MeViT with the ability to handle both local and global information, which is critical in distinguishing between various vegetation types in medium-resolution satellite images. By revising the mixed-scale convolutional feedforward network (MixCFN), MeViT balances accuracy with efficiency, achieving highly detailed results while remaining computationally efficient. Through extensive experiments on publicly available Thai Landsat data, MeViT has outperformed leading models in the field, showing superior performance across key metrics like precision, recall, F1 score, and mean IoU. This research highlights MeViT’s potential for scalable, accurate land-cover mapping in Southeast Asia, promising valuable insights for the agriculture sector. Check out my full work here, and let’s drive innovation together!

Date
2024 6:30 PM
Event
Young Scientists Quickfire Pitch
Location
National University of Singapore, Singapore
Teerapong Panboonyuen
Teerapong Panboonyuen

My research focuses on leveraging advanced machine intelligence techniques, specifically computer vision, to enhance semantic understanding, learning representations, visual recognition, and geospatial data interpretation.