Teerapong Panboonyuen

Senior AI Research Scientist, PostDoc Fellow

MARSAIL, Chula

My research focuses on Learning Representations—developing cutting-edge algorithms with optimization theory to push AI’s limits. I work with advanced models like GANs and Diffusion Models, leverage Self-Supervised Learning, explore how Adversarial Attacks on Large Language Models (LLMs) could reshape the future of AI.

I am currently a Senior Research Scientist at MARSAIL (Motor AI Recognition Solution Artificial Intelligence Laboratory) and a C2F High-Potential Postdoc at Chulalongkorn University. I received my Ph.D. in Computer Engineering from Chulalongkorn University, where I specialized in AI. On top of that, I’m the founder of MYRIDA.

Passionate about Cognitive Intelligence and unlocking human potential, I’m also deeply immersed in Geospatial Intelligence, where LLMs uncover groundbreaking insights that reshape how we understand and interact with our world.

Detailed summaries of my academic, industry, and teaching experience can be found in my CV or IEEE Biography, and get a glimpse into my personal life on my blog and tumblr. By the way, feel free to vibe to my music on SoundCloud.

Thai name: ธีรพงศ์ ปานบุญยืน, aka Kao Panboonyuen, or just Kao (เก้า).

Download my CV. Know Me in a Minute.

Interests

Applied Earth Observations
Geoscience and Remote Sensing
Computer Vision
Semantic Distillation
Human-AI Interaction
Learning Representations

Education

PostDoc Fellow in AI, 2026

Chulalongkorn University
PhD in Computer Engineering, 2020

Chulalongkorn University
MEng in Computer Engineering, 2017

Chulalongkorn University
BEng in Computer Engineering, 2015

KMUTNB (Top 1% in University Mathematics)
Pre-Engineering School (PET21), 2012

KMUTNB (Senior High School, 10th - 12th Grade)

Selected Awards

H.M. the King Bhumibhol Adulyadej’s 72nd Birthday Anniversary Scholarship (Master)
The 100th Anniversary Chulalongkorn University Fund for Doctoral Scholarship (Ph.D.)
The 90th Anniversary of Chulalongkorn University Scholarship (Ph.D.)
Postdoctoral Grant, Ratchadapisek Research Fund (RRF) (Chulalongkorn University) (Postdoc, 2021-2025)
Postdoctoral Research Grant, Second Century Fund (C2F) (Chulalongkorn University) (Postdoc, 2025-2026)
Top 1% Score in University Differential Calculus and Engineering Mathematics
2017 Best Student Paper Award in International Conference on Computing and Information Technology (IC2IT)
2019 Best Young Researcher Paper Award in First International Conference on Smart Technology & Urban Development (STUD)
2022 Bangkok Marathon 42.195K Finisher with successfully completed a full marathon run (42.195 kilometers) (Bangkok Marathon)
2024 IRONMAN 70.3 Finisher with successfully completed a challenging triathlon consisting of a 1.9K swim, 90K bike ride, and 21.1K run (IM70.3)
2024 Laguna Phuket Triathlon Finisher with successfully completed a challenging triathlon consisting of a 1.8K swim, 55K bike ride, and 12K run (LPT)
2024 Distinguished Reviewer for the Bronze Level of IEEE Transactions on Medical Imaging (Certificate)
2025 Chombueng Marathon 42.195K Finisher with successfully completed a full marathon run (42.195 kilometers) (Chombueng Marathon)
2025 Oral Presentation – Selected Among Top 12.5% Abstracts at the 14th Critical Care Conference, organized by the Thai Society of Critical Care Medicine (TSCCM)
2025 Global Young Scientists Summit (GYSS) Scholarship from Her Royal Highness Princess Maha Chakri Sirindhorn (GYSS)

Reviewer for International Journals/Conferences:

Invited Reviewer of Pattern Recognition (Elsevier)
Invited Reviewer of Neurocomputing (Elsevier)
Invited Reviewer of Transactions on Knowledge Discovery from Data (ACM)
Invited Reviewer of Computer Vision and Image Understanding (Elsevier)
Invited Reviewer of Computers and Geosciences (Elsevier)
Invited Reviewer of Neural Networks (Elsevier) (Certificate)
Invited Reviewer of Remote Sensing (MDPI)
Invited Reviewer of Artificial Intelligence Review (Nature Portfolio)
Invited Reviewer of Scientific Reports (Nature Portfolio) (Certificate)
Invited Reviewer of GIScience & Remote Sensing (Taylor & Francis)
Invited Reviewer of European Journal of Remote Sensing (Taylor & Francis)
Invited Reviewer of International Journal of Remote Sensing (Taylor & Francis)
Invited Reviewer of IEEE Transactions on Artificial Intelligence (IEEE)
Invited Reviewer of IEEE Transactions on Image Processing (IEEE)
Invited Reviewer of IEEE Transactions on Medical Imaging (IEEE) (Certificate)
Invited Reviewer of IEEE Transactions on Geoscience and Remote Sensing (IEEE)
Invited Reviewer of Pattern Analysis and Machine Intelligence (PAMI) (IEEE)
More reviews can be found under my WoS ID AAO-4985-2020.
More certificates of reviewers can be found at my GitHub Repository.

Additional Certifications in Research Ethics:

My certificate in GCP (Good Clinical Practice) – Ethics in Human Research is available in my GCP Certificate (English) and my GCP Certificate (Thai).

Selected Press

The Leader Asia: Dr. Teerapong and his team introduced their advanced AI for car damage detection at ICIAP 2023 in Udine, setting new accuracy standards with their innovative MARS model.
Techsauce: Highlighted their AI technology for automatic car damage assessment, earning recognition for excellence at ICIAP 2023 in Italy.
LINE TODAY: Showcased the MARS model at ICIAP 2023, noted for its high accuracy and setting new global standards in car damage detection.
Moneychat: Reported the award-winning innovation in AI for car damage estimation presented at ICIAP 2023.
Kaohoon: Celebrated the award-winning success of MARSAIL at ICIAP 2023.
Mitistock: Introduced the MARS model, featuring advanced self-attention mechanisms for vehicle damage assessment in Thailand.
The Story Thailand: Presented cutting-edge AI techniques in car wound detection, achieving high accuracy and setting international benchmarks.
Media of Thailand: Unveiled the MARS model at ICIAP 2023, recognized globally for its precision in car damage detection.
Thailand Insurance News: Featured Dr. Teerapong’s MARS model at ICIAP 2023 for its groundbreaking accuracy in car damage detection.
WealthPlusToday: Dr. Teerapong’s MARSAIL wowed ICIAP 2023 in Italy, clinching an excellence award for next-gen car damage detection.
Chulalongkorn University: Published a study on semantic road segmentation using deep convolutional neural networks.
Chula Engineering News: Featured Dr. Teerapong’s participation in the Global Young Scientists Summit (GYSS) 2025, highlighting academic leadership and global collaboration.
Thaivivat Insurance: Announced Dr. Teerapong’s research recognition at UAMC 2025, emphasizing advancements in AI for urban analytics and mobility challenges.

Featured Publications

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2025 In European Journal of Remote Sensing (Taylor & Francis)

GuidedBox: A segmentation-guided box teacher-student approach for weakly supervised road segmentation

Road segmentation in remote sensing is crucial for applications like urban planning, traffic monitoring, and autonomous driving. Labeling objects via pixel-wise segmentation is challenging compared to bounding boxes. Existing weakly supervised segmentation methods often rely on heuristic bounding box priors, but we propose that box-supervised techniques can yield better results. Introducing GuidedBox, an end-to-end framework for weakly supervised instance segmentation. GuidedBox uses a teacher model to generate high-quality pseudo-masks and employs a confidence scoring mechanism to filter out noisy masks. We also introduce a noise-aware pixel loss and affinity loss to optimize the student model with pseudo-masks. Our extensive experiments show that GuidedBox outperforms state-of-the-art methods like SOLOv2, CondInst, and Mask R-CNN on the Massachusetts Roads Dataset, achieving an AP50 score of 0.9231. It also shows strong performance on SpaceNet and DeepGlobe datasets, proving its versatility in remote sensing applications. Code has been made available at https://github.com/kaopanboonyuen/GuidedBox.

Teerapong Panboonyuen

2025 In the 14th Critical Care Conference

CU-ICU: Customizing Unsupervised Instruction-Finetuned Language Models for ICU Datasets via Text-to-Text Transfer Transformer

Integrating large language models into specialized domains like healthcare presents unique challenges, including domain adaptation and limited labeled data. We introduce CU-ICU, a method for customizing unsupervised instruction-finetuned language models for ICU datasets by leveraging the Text-to-Text Transfer Transformer (T5) architecture. CU-ICU employs a sparse fine-tuning approach that combines few-shot prompting with selective parameter updates, enabling efficient adaptation with minimal supervision. Our evaluation across critical ICU tasks—early sepsis detection, mortality prediction, and clinical note generation—demonstrates that CU-ICU consistently improves predictive accuracy and interpretability over standard fine-tuning methods. Notably, CU-ICU achieves up to a 15% increase in sepsis detection accuracy and a 20% enhancement in generating clinically relevant explanations while updating fewer than 1% of model parameters in its most efficient configuration. These results establish CU-ICU as a scalable, low-overhead solution for delivering accurate and interpretable clinical decision support in real-world ICU environments.

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2025 In IEEE Access

SatDiff: A Stable Diffusion Framework for Inpainting Very High-Resolution Satellite Imagery

Satellite image inpainting is a critical task in remote sensing, requiring accurate restoration of missing or occluded regions for reliable image analysis. In this paper, we present SatDiff, an advanced inpainting framework based on diffusion models, specifically designed to tackle the challenges posed by very high-resolution (VHR) satellite datasets such as DeepGlobe and the Massachusetts Roads Dataset. Building on insights from our previous work, SatInPaint, we enhance the approach to achieve even higher recall and overall performance. SatDiff introduces a novel Latent Space Conditioning technique that leverages a compact latent space for efficient and precise inpainting. Additionally, we integrate Explicit Propagation into the diffusion process, enabling forward-backward fusion for improved stability and accuracy. Inspired by encoder-decoder architectures like the Segment Anything Model (SAM), SatDiff is seamlessly adaptable to diverse satellite imagery scenarios. By balancing the efficiency of preconditioned models with the flexibility of postconditioned approaches, SatDiff establishes a new benchmark in VHR satellite datasets, offering a scalable and high-performance solution for satellite image restoration. The code for SatDiff is publicly available at https://github.com/kaopanboonyuen/SatDiff.

C. Charoenphon, Teerapong Panboonyuen, B. Zhang, C. Satirapod

2025 In Journal of Spatial Science (Taylor & Francis)

Investigating the use of deep learning-derived weighted mean temperature for GPS-PWVs estimation

GNSS data offers a reliable alternative for estimating Precipitable Water Vapor (PWV), but accurate GPS-PWV determination in tropical climates requires weighted mean temperature (Tm). With traditional measurement methods often unavailable in Thailand, and existing empirical models showing low accuracy, we propose a deep learning approach. Our Bidirectional Learning with Attention (BLA) model incorporates GRUs and an attention mechanism for Tm modeling. Trained on ERA5 data (2017-2021) and evaluated on 2022 data, BLA-Tm achieved 76% improvement over conventional models, reducing biases significantly. Validation with 280 GNSS stations confirmed BLA-Tm’s superior accuracy in GPS-PWV estimation.

Teerapong Panboonyuen

2025 In arXiv:2506.10524 [cs.CV]

ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation

This paper introduces ALBERT, an instance segmentation model designed specifically for comprehensive car damage and part segmentation. Leveraging the power of Bidirectional Encoder Representations, ALBERT incorporates advanced localization mechanisms to accurately identify and differentiate between real and fake damages as well as segment individual car parts. The model is trained on a large-scale, richly annotated automotive dataset, categorizing damage into 26 types, identifying 7 fake damage variants, and segmenting 61 distinct car parts. Our approach demonstrates strong performance in both segmentation accuracy and damage classification, paving the way for intelligent automotive inspection and assessment applications. This work not only contributes a powerful tool for automated vehicle inspection but also lays the groundwork for future research in intelligent automotive diagnostics, safety evaluation, and insurance claim automation, with significant implications for both industry and research communities.

Teerapong Panboonyuen

2025 In arXiv:2506.10528 [cs.CV]

SLICK: Selective Localization and Instance Calibration for Knowledge-Enhanced Car Damage Segmentation in Automotive Insurance

We propose SLICK, a novel and efficient framework for high-precision car damage segmentation, designed for real-world deployment in automotive insurance and inspection workflows. SLICK introduces five synergistic components, selective part segmentation guided by structural priors, localization-aware attention to highlight fine-grained damage, instance-sensitive refinement for precise boundary separation, cross-channel calibration to amplify subtle cues like scratches and dents, and a knowledge fusion module that integrates synthetic crash data, part geometry, and annotated insurance datasets. Trained using a teacher–student distillation strategy with ALBERT as the teacher, SLICK retains high segmentation fidelity while achieving up to 7× faster inference. Extensive experiments on large-scale automotive datasets demonstrate SLICK’s superior accuracy, generalization, and runtime efficiency—making it ideal for real-time, high-stakes applications in insurance automation and vehicle inspection.

Teerapong Panboonyuen

2024 In 17th International Conference on Knowledge and Smart Technology (KST2025)

SEA-ViT: Sea Surface Currents Forecasting Using Vision Transformer and GRU-Based Spatio-Temporal Covariance Modeling

Forecasting sea surface currents is essential for applications such as maritime navigation, environmental monitoring, and climate analysis, particularly in regions like the Gulf of Thailand and the Andaman Sea. This paper introduces SEA-ViT, an advanced deep learning model that integrates Vision Transformer (ViT) with bidirectional Gated Recurrent Units (GRUs) to capture spatio-temporal covariance for predicting sea surface currents (U, V) using high-frequency radar (HF) data. The name SEA-ViT is derived from Sea Surface Currents Forecasting using Vision Transformer, highlighting the model’s emphasis on ocean dynamics and its use of the ViT architecture to enhance forecasting capabilities. SEA-ViT is designed to unravel complex dependencies by leveraging a rich dataset spanning over 30 years and incorporating ENSO indices (El Niño, La Niña, and neutral phases) to address the intricate relationship between geographic coordinates and climatic variations. This development enhances the predictive capabilities for sea surface currents, supporting the efforts of the Geo-Informatics and Space Technology Development Agency (GISTDA) in Thailand’s maritime regions. The code and pretrained models are available at https://github.com/kaopanboonyuen/gistda-ai-sea-surface-currents.

Teerapong Panboonyuen, N. Rattanachona, P. Thungthin, N. Subsompon, S. Thongbai, W. Wongweeranimit, R. Phukham

2024 In 5th International Conference on Highway Engineering ICHE 2024

REG: Refined Generalized Focal Loss for Road Asset Detection on Thai Highways Using Vision-Based Detection and Segmentation Models

This paper dives into the cutting-edge world of road asset detection on Thai highways, showcasing a novel approach that combines an upgraded REG model with Generalized Focal Loss. Our focus is on identifying key road elements—like pavilions, pedestrian bridges, information and warning signs, and concrete guardrails—to boost road safety and infrastructure management. While deep learning methods have shown promise, traditional models often struggle with accuracy in tricky conditions, such as cluttered backgrounds and variable lighting. To tackle these issues, we’ve integrated REG with Generalized Focal Loss, enhancing its ability to detect road assets with greater precision. Our results are impressive, the REGx model led the way with a mAP50 of 80.340, mAP50-95 of 60.840, precision of 79.100, recall of 76.680, and an F1-score of 77.870. These findings highlight the REGx model’s superior performance, demonstrating the power of advanced deep learning techniques to improve highway safety and infrastructure maintenance, even in challenging conditions.

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2023 In Remote Sensing

MeViT: A Medium-Resolution Vision Transformer for Semantic Segmentation on Landsat Satellite Imagery for Agriculture in Thailand

In this paper, we present MeViT (Medium-Resolution Vision Transformer), designed for semantic segmentation of Landsat satellite imagery, focusing on key economic crops in Thailand para rubber, corn, and pineapple. MeViT enhances Vision Transformers (ViTs) by integrating medium-resolution multi-branch architectures and revising mixed-scale convolutional feedforward networks (MixCFN) to extract multi-scale local information. Extensive experiments on a public Thailand dataset demonstrate that MeViT outperforms state-of-the-art deep learning methods, achieving a precision of 92.22%, recall of 94.69%, F1 score of 93.44%, and mean IoU of 83.63%. These results highlight MeViT’s effectiveness in accurately segmenting Thai Landsat-8 data.

Teerapong Panboonyuen, N. Nithisopa, P. Pienroj, L. Jirachuphun, C. Watthanasirikrit, N. Pornwiriyakul

2023 In Image Analysis and Processing ICIAP 2023

MARS: Mask Attention Refinement with Sequential Quadtree Nodes for Car Damage Instance Segmentation

Evaluating car damages is crucial for the car insurance industry, but current deep learning networks fall short in accuracy due to inadequacies in handling car damage images and producing fine segmentation masks. This paper introduces MARS (Mask Attention Refinement with Sequential quadtree nodes) for instance segmentation of car damages. MARS employs self-attention mechanisms to capture global dependencies within sequential quadtree nodes and a quadtree transformer to recalibrate channel weights, resulting in highly accurate instance masks. Extensive experiments show that MARS significantly outperforms state-of-the-art methods like Mask R-CNN, PointRend, and Mask Transfiner on three popular benchmarks, achieving a +1.3 maskAP improvement with the R50-FPN backbone and +2.3 maskAP with the R101-FPN backbone on the Thai car-damage dataset. Demos are available at https://github.com/kaopanboonyuen/MARS.

T. Vajeethaveesin, Teerapong Panboonyuen

2022 In Trends in Sciences (Trends Sci. or TiS)

A Performance Comparison between GIS-based and Neuron Network Methods for Flood Susceptibility Assessment in Ayutthaya Province

Flooding poses a significant challenge in Thailand due to its complex geography, traditionally addressed through GIS methods like the Flood Risk Assessment Model (FRAM) combined with the Analytical Hierarchy Process (AHP). This study assesses the efficacy of Artificial Neural Networks (ANN) in flood susceptibility mapping, using data from Ayutthaya Province and incorporating 5-fold cross-validation and Stochastic Gradient Descent (SGD) for training. ANN achieved superior performance with precision of 79.90%, recall of 79.04%, F1-score of 79.08%, and accuracy of 79.31%, outperforming the traditional FRAM approach. Notably, ANN identified that only three factors—flow accumulation, elevation, and soil types—were crucial for predicting flood-prone areas. This highlights the potential for ANN to simplify and enhance flood risk assessments. Moreover, the integration of advanced machine learning techniques underscores the evolving capability of AI in addressing complex environmental challenges.

Teerapong Panboonyuen

2020 In Chulalongkorn University Thesis Evaluation - Very Good Score (Outstanding Achievement)

Semantic Segmentation on Remotely Sensed Images Using Deep Convolutional Encoder-Decoder Neural Network

My PhD thesis focuses on improving semantic segmentation of aerial and satellite images, a crucial task for applications like agriculture planning, map updates, route optimization, and navigation. Current models like the Deep Convolutional Encoder-Decoder (DCED) have limitations in accuracy due to their inability to recover low-level features and the scarcity of training data. To address these issues, I propose a new architecture with five key enhancements, a Global Convolutional Network (GCN) for improved feature extraction, channel attention for selecting discriminative features, domain-specific transfer learning to address data scarcity, Feature Fusion (FF) for capturing low-level details, and Depthwise Atrous Convolution (DA) for refining features. Experiments on Landsat-8 datasets and the ISPRS Vaihingen benchmark showed that my proposed architecture significantly outperforms the baseline models in remote sensing imagery.

I. Wichakam, Teerapong Panboonyuen, C. Udomcharoenchaikit, P. Vateekul

2018 In International Conference on Multimedia Modeling MMM 2018

Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network

Colorectal cancer is one of the leading causes of cancer death worldwide. As of now, colonoscopy is the most effective screening tool for diagnosing colorectal cancer by searching for polyps which can develop into colon cancer. The drawback of manual colonoscopy process is its high polyp miss rate. Therefore, polyp detection is a crucial issue in the development of colonoscopy application. Despite having high evaluation scores, the recently published methods based on fully convolutional network (FCN) require a very long inferring (testing) time that cannot be applied in a real clinical process due to a large number of parameters in the network. In this paper, we proposed a compressed fully convolutional network by modifying the FCN-8s network, so our network is able to detect and segment polyp from video images within a real-time constraint in a practical screening routine. Furthermore, our customized loss function allows our network to be more robust when compared to the traditional cross-entropy loss function. The experiment was conducted on CVC-EndoSceneStill database which consists of 912 video frames from 36 patients. Our proposed framework has obtained state-of-the-art results while running more than 7 times faster and requiring fewer weight parameters by more than 9 times. The experimental results convey that our system has the potential to support clinicians during the analysis of colonoscopy video by automatically indicating the suspicious polyps locations.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2017 In Remote Sensing

Road segmentation of remotely-sensed images using deep convolutional neural networks with landscape metrics and conditional random fields

Semantic segmentation of remotely-sensed aerial (or very-high resolution, VHS) images and satellite (or high-resolution, HR) images has numerous application domains, particularly in road extraction, where the segmented objects serve as essential layers in geospatial databases. Despite several efforts to use deep convolutional neural networks (DCNNs) for road extraction from remote sensing images, accuracy remains a challenge. This paper introduces an enhanced DCNN framework specifically designed for road extraction from remote sensing images by incorporating landscape metrics (LMs) and conditional random fields (CRFs). Our framework employs the exponential linear unit (ELU) activation function to improve the DCNN, leading to a higher quantity and more accurate road extraction. Additionally, to minimize false classifications of road objects, we propose a solution based on the integration of LMs. To further refine the extracted roads, a CRF method is incorporated into our framework. Experiments conducted on Massachusetts road aerial imagery and Thailand Earth Observation System (THEOS) satellite imagery datasets demonstrated that our proposed framework outperforms SegNet, a state-of-the-art object segmentation technique, in most cases regarding precision, recall, and F1 score across various types of remote sensing imagery.

Publications

To find relevant content, try searching publications, filtering using the buttons below, or exploring popular topics. A * denotes equal contribution.

GuidedBox: A segmentation-guided box teacher-student approach for weakly supervised road segmentation

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2025 In European Journal of Remote Sensing (Taylor & Francis)

CU-ICU: Customizing Unsupervised Instruction-Finetuned Language Models for ICU Datasets via Text-to-Text Transfer Transformer

Teerapong Panboonyuen

2025 In the 14th Critical Care Conference

SatDiff: A Stable Diffusion Framework for Inpainting Very High-Resolution Satellite Imagery

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2025 In IEEE Access

Investigating the use of deep learning-derived weighted mean temperature for GPS-PWVs estimation

C. Charoenphon, Teerapong Panboonyuen, B. Zhang, C. Satirapod

2025 In Journal of Spatial Science (Taylor & Francis)

ALBERT: Advanced Localization and Bidirectional Encoder Representations from Transformers for Automotive Damage Evaluation

Teerapong Panboonyuen

2025 In arXiv:2506.10524 [cs.CV]

SLICK: Selective Localization and Instance Calibration for Knowledge-Enhanced Car Damage Segmentation in Automotive Insurance

Teerapong Panboonyuen

2025 In arXiv:2506.10528 [cs.CV]

DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation

In this paper, we present a novel end-to-end framework that integrates ResNet and Vision Transformer (ViT) backbones with cutting-edge techniques such as Deformable Convolutions, Retrieval-Augmented Generation, and Conditional Random Fields (CRF). These innovations work together to significantly improve feature representation and Optical Character Recognition (OCR) performance. By replacing the standard convolution layers in the third and fourth blocks with Deformable Convolutions, the framework adapts more flexibly to complex text layouts, while adaptive dropout helps prevent overfitting and enhance generalization. Moreover, incorporating CRFs refines the sequence modeling for more accurate text recognition. Extensive experiments on six benchmark datasets—IC13, IC15, SVT, IIIT5K, SVTP, and CUTE80—demonstrate the framework’s exceptional performance. Our method represents a significant leap forward in OCR technology, addressing challenges in recognizing text with various distortions, fonts, and orientations. The framework has proven not only effective in controlled conditions but also adaptable to more complex, real-world scenarios. The code for this framework is available at https://github.com/kaopanboonyuen/DOTA.

N. Nithisopa, Teerapong Panboonyuen

2025 In 17th International Conference on Knowledge and Smart Technology (KST2025)

DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation

ENGRU: A Preliminary Investigation of AI-Augmented Formal Verification and Its Challenges

State-space graphs and automata are essential for modeling and analyzing computational systems. Recurrent neural networks (RNNs) underpin language models by processing sequential data and capturing contextual dependencies. Both RNNs and state-space graphs evaluate discrete-time systems, but their equivalence, especially in sentence structure modeling, remains unresolved. This paper introduces ENGRU (Enhanced Gated Recurrent Units), a deep learning approach for formal verification. ENGRU combines model checking, Colored Petri Nets (CPNs), and sequential learning to analyze systems abstractly. CPNs undergo state-space enumeration to generate graphs and automata, which are transformed into sequential representations for ENGRU to learn and predict system behaviors. ENGRU effectively predicts goal states in discrete-time models, aiding early bug detection and predictive state-space exploration. Experimental results show high accuracy and efficiency in goal state predictions. ENGRU’s source code is available at https://github.com/kaopanboonyuen/ENGRU.

C. Dechsupa, Teerapong Panboonyuen, W. Vatanawood

2025 In IEEE Access

ENGRU: A Preliminary Investigation of AI-Augmented Formal Verification and Its Challenges

SEA-ViT: Sea Surface Currents Forecasting Using Vision Transformer and GRU-Based Spatio-Temporal Covariance Modeling

Teerapong Panboonyuen

2024 In 17th International Conference on Knowledge and Smart Technology (KST2025)

REG: Refined Generalized Focal Loss for Road Asset Detection on Thai Highways Using Vision-Based Detection and Segmentation Models

Teerapong Panboonyuen, N. Rattanachona, P. Thungthin, N. Subsompon, S. Thongbai, W. Wongweeranimit, R. Phukham

2024 In 5th International Conference on Highway Engineering ICHE 2024

MeViT: A Medium-Resolution Vision Transformer for Semantic Segmentation on Landsat Satellite Imagery for Agriculture in Thailand

Teerapong Panboonyuen, C. Charoenphon, C. Satirapod

2023 In Remote Sensing

MARS: Mask Attention Refinement with Sequential Quadtree Nodes for Car Damage Instance Segmentation

Teerapong Panboonyuen, N. Nithisopa, P. Pienroj, L. Jirachuphun, C. Watthanasirikrit, N. Pornwiriyakul

2023 In Image Analysis and Processing ICIAP 2023

Object Detection of Road Assets Using Transformer-Based YOLOX with Feature Pyramid Decoder on Thai Highway Panorama

Detecting objects of varying sizes, like kilometer stones, remains a significant challenge and directly affects the accuracy of object counts. Transformers have shown remarkable success in natural language processing (NLP) and image processing due to their ability to model long-range dependencies. This paper proposes an enhanced YOLO (You Only Look Once) series with two key contributions, (i) We employ a pre-training objective to obtain original visual tokens from image patches of road assets, using a pre-trained Vision Transformer (ViT) backbone, which is then fine-tuned on downstream tasks with additional task layers. (ii) We incorporate Feature Pyramid Network (FPN) decoder designs into our deep learning network to learn the significance of different input features, avoiding issues like feature mismatch and performance degradation that arise from simple summation or concatenation. Our proposed method, Transformer-Based YOLOX with FPN, effectively learns general representations of objects and significantly outperforms state-of-the-art detectors, including YOLOv5S, YOLOv5M, and YOLOv5L. It achieves a 61.5% AP on the Thailand highway corpus, surpassing the current best practice (YOLOv5L) by 2.56% AP on the test-dev dataset.

Teerapong Panboonyuen, S. Thongbai, W. Wongweeranimit, P. Santitamnont, K. Suphan, C. Charoenphon

2022 In Information

Object Detection of Road Assets Using Transformer-Based YOLOX with Feature Pyramid Decoder on Thai Highway Panorama

Quality of Life Prediction in Driving Scenes on Thailand Roads Using Information Extraction from Deep Convolutional Neural Networks

In today’s world, urban design and sustainable development are crucial for megacities, impacting residents’ wellbeing. Quality of Life (QOL) is a key performance indicator (KPI) used to measure the effectiveness of city planning. Traditionally, QOL is assessed through costly and time-consuming surveys, but our AI-based approach offers a more efficient solution. Using Bangkok as a case study, we apply deep convolutional neural networks (DCNNs) for semantic segmentation and object detection to gather relevant image data. Then, we use linear regression to infer QOL scores. Our method, tested with state-of-the-art models and public datasets, proves to be a practical alternative for QOL assessment, with implementation codes and datasets available at https://kaopanboonyuen.github.io/bkkurbanscapes.

K. Thitisiriwech, Teerapong Panboonyuen, P. Kantavat, Y. Iwahori, B. Kijsirikul

2022 In Sustainability

Quality of Life Prediction in Driving Scenes on Thailand Roads Using Information Extraction from Deep Convolutional Neural Networks

A Performance Comparison between GIS-based and Neuron Network Methods for Flood Susceptibility Assessment in Ayutthaya Province

T. Vajeethaveesin, Teerapong Panboonyuen

2022 In Trends in Sciences (Trends Sci. or TiS)

Enhanced Feature Pyramid Vision Transformer for Semantic Segmentation on Thailand Landsat-8 Corpus

Semantic segmentation on Landsat-8 data is crucial in the integration of diverse data, allowing researchers to achieve more productivity and lower expenses. This research aimed to improve the versatile backbone for dense prediction without convolutions—namely, using the pyramid vision transformer (PRM-VS-TM) to incorporate attention mechanisms across various feature maps. Furthermore, the PRM-VS-TM constructs an end-to-end object detection system without convolutions and uses handcrafted components, such as dense anchors and non-maximum suspension (NMS). The present study was conducted on a private dataset, i.e., the Thailand Landsat-8 challenge. There are three baselines, DeepLab, Swin Transformer (Swin TF), and PRM-VS-TM. Results indicate that the proposed model significantly outperforms all current baselines on the Thailand Landsat-8 corpus, providing F1-scores greater than 80% in almost all categories. Finally, we demonstrate that our model, without utilizing pre-trained settings or any further post-processing, can outperform current state-of-the-art (SOTA) methods for both agriculture and forest classes.

Teerapong Panboonyuen, P. Rakwatin, K. Intarat

2020 In Information

Enhanced Feature Pyramid Vision Transformer for Semantic Segmentation on Thailand Landsat-8 Corpus

The Bangkok Urbanscapes Dataset for Semantic Urban Scene Understanding Using Enhanced Encoder-Decoder with Atrous Depthwise Separable A1 Convolutional Neural Networks

This paper addresses semantic segmentation for autonomous driving systems, focusing on self-driving cars in Thailand. We introduce DeepLab-V3-A1 with Xception, an enhanced version of DeepLab-V3+, and present the Bangkok Urbanscapes dataset. Our method improves segmentation accuracy by refining the decoder and modifying the Xception backbone. Experiments on four datasets, including CamVid, Cityscapes, IDD, and our proposed dataset, show our approach performs comparably to baseline methods. Our dataset includes 701 annotated images of various Bangkok driving environments, covering eleven semantic classes. The architecture and dataset aim to aid developers in improving autonomous driving systems for diverse urban conditions. Implementation codes and dataset are available at https://kaopanboonyuen.github.io/bkkurbanscapes.

K. Thitisiriwech, Teerapong Panboonyuen, P. Kantavat, Y. Iwahori, B. Kijsirikul

2020 In IEEE Access

The Bangkok Urbanscapes Dataset for Semantic Urban Scene Understanding Using Enhanced Encoder-Decoder with Atrous Depthwise Separable A1 Convolutional Neural Networks

Transformer-Based Decoder Designs for Semantic Segmentation on Remotely Sensed Images

Transformers have demonstrated remarkable accomplishments in several natural language processing (NLP) tasks as well as image processing tasks. Herein, we present a deep-learning (DL) model that is capable of improving the semantic segmentation network in two ways. First, utilizing the pre-training Swin Transformer (SwinTF) under Vision Transformer (ViT) as a backbone, the model weights downstream tasks by joining task layers upon the pretrained encoder. Secondly, decoder designs are applied to our DL network with three decoder designs, U-Net, pyramid scene parsing (PSP) network, and feature pyramid network (FPN), to perform pixel-level segmentation. The results are compared with other image labeling state of the art (SOTA) methods, such as global convolutional network (GCN) and ViT. Extensive experiments show that our Swin Transformer (SwinTF) with decoder designs reached a new state of the art on the Thailand Isan Landsat-8 corpus (89.8% 𝐹1 score), Thailand North Landsat-8 corpus (63.12% 𝐹1 score), and competitive results on ISPRS Vaihingen. Moreover, both our best-proposed methods (SwinTF-PSP and SwinTF-FPN) even outperformed SwinTF with supervised pre-training ViT on the ImageNet-1K in the Thailand, Landsat-8, and ISPRS Vaihingen corpora.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2020 In Remote Sesning

Transformer-Based Decoder Designs for Semantic Segmentation on Remotely Sensed Images

Semantic Segmentation on Remotely Sensed Images Using Deep Convolutional Encoder-Decoder Neural Network

Teerapong Panboonyuen

2020 In Chulalongkorn University Thesis Evaluation - Very Good Score (Outstanding Achievement)

Semantic Labeling in Remote Sensing Corpora Using Feature Fusion-Based Enhanced Global Convolutional Network with High-Resolution Representations and Depthwise Atrous Convolution

This paper addresses improving semantic segmentation in remote sensing for aerial and satellite images, which is crucial for agriculture, map updates, route optimization, and navigation. We propose enhancements to the state-of-the-art Enhanced Global Convolutional Network (GCN152-TL-A) by introducing a High-Resolution Representation (HR) backbone for better feature extraction, Feature Fusion (FF) to capture low-level details, and Depthwise Atrous Convolution (DA) for refined multi-resolution features. Experiments on Landsat-8 and ISPRS Vaihingen datasets demonstrate our model’s superior performance, achieving over 90% accuracy in F1 scores and outperforming baseline models.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2019 In Remote Sensing

Semantic Labeling in Remote Sensing Corpora Using Feature Fusion-Based Enhanced Global Convolutional Network with High-Resolution Representations and Depthwise Atrous Convolution

Semantic Segmentation on Remotely Sensed Images Using an Enhanced Global Convolutional Network with Channel Attention and Domain Specific Transfer Learning

In the remote sensing domain, it is crucial to complete semantic segmentation on the raster images, e.g., river, building, forest, etc., on raster images. A deep convolutional encoder–decoder (DCED) network is the state-of-the-art semantic segmentation method for remotely sensed images. However, the accuracy is still limited, since the network is not designed for remotely sensed images and the training data in this domain is deficient. In this paper, we aim to propose a novel CNN for semantic segmentation particularly for remote sensing corpora with three main contributions. First, we propose applying a recent CNN called a global convolutional network (GCN), since it can capture different resolutions by extracting multi-scale features from different stages of the network. Additionally, we further enhance the network by improving its backbone using larger numbers of layers, which is suitable for medium resolution remotely sensed images. Second, “channel attention” is presented in our network in order to select the most discriminative filters (features). Third, “domain-specific transfer learning” is introduced to alleviate the scarcity issue by utilizing other remotely sensed corpora with different resolutions as pre-trained data. The experiment was then conducted on two given datasets (i) medium resolution data collected from Landsat-8 satellite and (ii) very high resolution data called the ISPRS Vaihingen Challenge Dataset. The results show that our networks outperformed DCED in terms of 𝐹1 for 17.48% and 2.49% on medium and very high resolution corpora, respectively.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2019 In Remote Sesning

Semantic Segmentation on Remotely Sensed Images Using an Enhanced Global Convolutional Network with Channel Attention and Domain Specific Transfer Learning

Transportation Mobility Factor Extraction Using Image Recognition Techniques

For an urban development, the Quality of Life (QOL) of people in the city is a vital issue that should be considered. There are many researches in QOL topics that use questionnaire survey approach. These studies yield very useful information for city development planning. As the Artificial Intelligence technologies are developed very fast recently, they are applied to solve many transportation problems. In this paper, we propose a method that automatically extract mobility indicators using two image recognition techniques, Semantic Segmentation and Object Recognition. Because the mobility is an important factor in QOL evaluation, our work can be used to enhance a performance and reduce a data gathering cost of the QOL evaluation.

P. Kantavat, Y. Hayashi, G. City, B. Kijsirikul, Teerapong Panboonyuen, Y. Iwahori

2019 In First International Conference on Smart Technology & Urban Development Best Paper; (STUD 2019)

Transportation Mobility Factor Extraction Using Image Recognition Techniques

Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network

I. Wichakam, Teerapong Panboonyuen, C. Udomcharoenchaikit, P. Vateekul

2018 In International Conference on Multimedia Modeling MMM 2018

Semantic Segmentation On Medium-Resolution Satellite Images Using Deep Convolutional Networks With Remote Sensing Derived Indices

Semantic Segmentation is a fundamental task in computer vision and remote sensing imagery. Many applications, such as urban planning, change detection, and environmental monitoring, require the accurate segmentation; hence, most segmentation tasks are performed by humans. Currently, with the growth of Deep Convolutional Neural Network (DCNN), there are many works aiming to find the best network architecture fitting for this task. However, all of the studies are based on very-high resolution satellite images, and surprisingly; none of them are implemented on medium resolution satellite images. Moreover, no research has applied geoinformatics knowledge. Therefore, we purpose to compare the semantic segmentation models, which are FCN, SegNet, and GSN using medium resolution images from Landsat-8 satellite. In addition, we propose a modified SegNet model that can be used with remote sensing derived indices. The results show that the model that achieves the highest accuracy RGB bands of medium resolution aerial imagery is SegNet. The overall accuracy of the model increases when includes Near Infrared (NIR) and Short-Wave Infrared (SWIR) band. The results showed that our proposed method (our modified SegNet model, named RGB-IR-IDX-MSN method) outperforms all of the baselines in terms of mean F1 scores.

S. Chantharaj, K. Pornratthanapong, P. Chitsinpchayakun, Teerapong Panboonyuen

2018 In *15th International Joint Conference on Computer Science and Software Engineering * JCSSE 2018

Semantic Segmentation On Medium-Resolution Satellite Images Using Deep Convolutional Networks With Remote Sensing Derived Indices

Road segmentation of remotely-sensed images using deep convolutional neural networks with landscape metrics and conditional random fields

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2017 In Remote Sensing

An enhanced deep convolutional encoder-decoder network for road segmentation on aerial imagery

In this paper, we introduce an improved deep convolutional encoder-decoder network (DCED) for segmenting road objects from aerial images. Enhancements include the use of ELU (exponential linear unit) instead of ReLU, dataset augmentation with incrementally-rotated images to increase training data by eight times, and the use of landscape metrics to remove false road objects. Tested on the Massachusetts Roads dataset, our method outperformed the SegNet benchmark and other baselines in precision, recall, and F1 scores.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2017 In International Conference on Computing and Information Technology (IC2IT) Best Student Paper Honourable Mention (top 0.26% of submissions)

An enhanced deep convolutional encoder-decoder network for road segmentation on aerial imagery

Road map extraction from satellite imagery using connected component analysis and landscape metrics

Road map extraction is vital for GIS and underpins many location-based applications like GPS navigation, delivery route planning, tourist attraction locating, and location-based marketing. This research uses satellite imagery, though other remotely sensed images like aerial photographs, UAVs, or drones are also applicable. Despite various proposed methods focusing primarily on accuracy, completeness of results is equally important. We enhance accuracy by incorporating connected component analysis and improve completeness using landscape metrics, which describe spatial characteristics through shape and isolation indices. Evaluated on precision, recall, quality, and F1 scores, our method achieves over 90% performance in all criteria.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2017 In IEEE International Conference on Big Data Big Data 2017

Road map extraction from satellite imagery using connected component analysis and landscape metrics

Image Vectorization of Road Satellite Data Sets

Data extraction of geo-spatial objects from satellite images is a crucial step in facilitating analysis of geo-spatial or spatio-temporal data, typically involving line (road) and polygon (area) layers. This paper introduces a method for transforming satellite data (raster images) containing roads from pixel form into spatial objects comprising lines and polygons. Our algorithm consists of three primary steps. First, roads are isolated from other objects using k-means clustering. Second, line extraction is performed on the road areas by applying morphological operations to skeletonize the image, followed by enhancement using the Ramer-Douglas-Peucker algorithm. Finally, land-cover classification is applied to non-road objects to extract polygons. Experimental results demonstrate that both lines (road networks) and polygons (areas) can be accurately extracted from satellite imagery simultaneously.

Teerapong Panboonyuen, P. Vateekul, S. Lawawirojwong

2016 In Journal of Remote Sensing and GIS Association of Thailand RESGAT

Image Vectorization of Road Satellite Data Sets

Featured Talks

TSCCM2025 (The 14th Critical Care Conference)

I was honored to be selected—among only 8 individuals—for an oral presentation at the 14th Critical Care Conference. My talk focused on CU-ICU, a Thai-language instruction-tuned model designed to assist ICU practitioners. Built on the T5 architecture, the model was fine-tuned using efficient techniques such as LoRA, AdaLoRA, and IA3. CU-ICU integrates evidence-based clinical guidelines, including those from the Surviving Sepsis Campaign, to support clinical reasoning.

2025 1:30 PM Bhumisiri Mangkhalanusorn Building, King Chulalongkorn Memorial Hospital

TSCCM2025 (The 14th Critical Care Conference)

UAMC2025 (Applied Mathematics Conference)

I was invited to give a talk on the transformative impact of Vision Transformers (ViTs) in the car insurance industry. The session explored the mathematical foundations of ViTs, highlighting how the self-attention mechanism enables precise visual data analysis. I also examined the use of custom loss functions to enhance claim prediction accuracy and discussed the integration of Large Language Models (LLMs) to further advance AI capabilities.

2025 1:00 PM King Mongkut's Institute of Technology Ladkrabang (KMITL)

UAMC2025 (Applied Mathematics Conference)

NAC2025 (NSTDA Annual Conference 2025)

I was invited by NSTDA to teach a session focused on advanced artificial intelligence and computer vision, using the iconic “Where’s Waldo” problem as a practical case study. The session explored how AI can be trained to detect complex visual patterns—not just identifying a character in a busy scene, but demonstrating the power of modern deep learning techniques. I explored cutting-edge techniques like Vision Transformers to develop models with exceptional precision in image recognition, showcasing how these advanced architectures are redefining the limits of AI perception and understanding.

2025 9:30 AM National Science and Technology Development Agency (NSTDA)

Young Scientists Quickfire Pitch

In this quick pitch, I’m thrilled to introduce MeViT—a medium-resolution Vision Transformer developed for high-precision semantic segmentation of Landsat satellite imagery.

2024 6:30 PM National University of Singapore, Singapore

Exploring Careers as an AI Research Scientist

I recently had the opportunity to speak with high school students about ‘Career Paths for AI Research Scientists.’ During the talk, I shared my experiences as a postdoctoral researcher in AI, diving into the exciting world of artificial intelligence. I discussed the various career opportunities in the field, from academic research to industry roles, and highlighted how my own journey has shaped my understanding of Generative AI.

2024 9:05 AM NSTDA, Pathum Thani, Thailand

Exploring Careers as an AI Research Scientist

Inspiring the Future of AI Innovations

I had the opportunity to give a final orientation speech to the undergraduate students of the Department of Electrical and Computer Engineering at KMUTNB. The focus of my speech was on the transformative impact of AI, particularly highlighting the advancements in Large Language Models (LLMs) like ChatGPT. I discussed how these models have revolutionized natural language processing, enabling sophisticated interactions and problem-solving capabilities.

2024 9:09 AM ECE KMUTNB, BKK

Geospatial Big Data Analytics

I was invited to present on advanced geospatial data analytics, focusing on how spatial and geographic information can be transformed into actionable insights. Leveraging PySpark for distributed computing significantly accelerates data processing, allowing efficient handling of large-scale geospatial datasets. The course also introduced distributed machine learning techniques to build scalable, high-performance predictive models.

2023 8:15 AM Geo-Informatics and Space Technology Development Agency (GISTDA)

Invited to Italy for ICIAP2023

I was invited to Italy to present my research, “MARS, Mask Attention Refinement with Sequential Quadtree Nodes,” at the ICIAP 2023 Workshop. This prestigious conference, organized biennially by CVPL under the International Association for Pattern Recognition (IAPR), will unite experts to discuss advancements in car insurance and computer vision. My research addresses the challenge of accurately evaluating car damages using MARS, which enhances instance segmentation through self-attention mechanisms and quadtree transformers.

2023 10:30 AM University of Udine, Italy

Distributed ML for Geospatial Data

I was invited to teach a course on distributed machine learning to the Geo-Informatics and Space Technology Development Agency (GISTDA). The curriculum covered fundamental concepts of PySpark, basic deep learning techniques, and practical applications of distributed training using TensorFlow.

2022 8:45 AM Geo-Informatics and Space Technology Development Agency (GISTDA)

Achieve Data Science First Meet

I was invited to speak at the “Achieve Data Science First Meet” for a MOOC student project event, where I highlighted the growing recognition of data science, AI, and machine learning’s importance across various industries. I advised that organizations, regardless of their size or sector, must effectively develop and implement data science capabilities to stay competitive in the era of big data, or risk falling behind.

2020 9:30 AM Victor Club, Samyan Mitrtown, BKK

See all events

Research Communities

Visiting Faculty - College of Computing, Khon Kaen University
- June 2023 - Present
- Teach courses:
  - SC310005 Artificial Intelligence and Machine Learning Application: Introduction to AI and ML concepts and their applications.
  - CP020002 Smart Process Management: Techniques for optimizing and automating business processes.
  - SC320002 Business Intelligence: Methods for data analysis and decision-making in business contexts.
  - CP020001 Introduction to Computers and Programming: Basics of computer systems and introductory programming skills.
  - DE200001 Fundamentals of Data Engineering: Introduction to data engineering concepts and fundamental tools for beginners.
- Ministerial Order on the Appointment of Academic Staff (Order 5907-2566)
- Invitation Letter for a Special Lecturer Position (Order อว 660101.26/9304)
- Invitation Letter for a Special Lecturer Position (Order อว 660101.26/24844)
- Invitation Letter for a Special Lecturer Position (Order อว 660101.26/13320)
Guest Lecturer and AI Committee Member
- Keynote Speaker - UAMC2025
  - Delivered keynote on “Mathematical Foundations of Vision Transformers in Car Insurance AI.”
  - Poster: UAMC2025
- AI Instructor - Department of Lands, Thailand (2025)
  - Taught Large Language Models (LLMs) using land title deed data and demonstrated AI-driven automation for creating land deeds.
  - Code and Lecture Slides
- AI Instructor - Office of the Cane and Sugar Board (2025)
  - Leveraged Vision Transformer model for accurate sugarcane area classification and boundary detection from satellite images.
- NSTDA One Day Camp at Sirindhorn Science Home (2024)
  - Talking about career opportunities and becoming a research scientist in AI as part of the GYSS2025 scholarship program.
  - Full Blog and Slide: Career Paths for AI Research Scientists
- 2108421 Modern Integrated Survey Technology (MIST) - Chulalongkorn University
  - Guided students in applying Machine Learning to survey engineering.
- CP411701 AI Inspiration Course - Khon Kaen University
  - Delivered a lecture on “Generative AI: Current Trends and Practical Applications” at the College of Computing, Khon Kaen University.
  - Slide: Generative AI
- The 7th KVIS Invitational Science Fair
  - Served as a committee member for the AI project at Kamnoetvidya Science Academy, Rayong, Thailand (29 January - 2 February 2024).
- Industrial Advisory Board (IAB) - ECE KMUTNB
  - Contributed to curriculum assessment and provided comments for the Department of Electrical and Computer Engineering (ECE).
- AI and ML Instructor - Nomklao Kunnathi Demonstration School
  - Taught AI and ML under the Design Graphics Science and Technology Learning Group for high school students (Grade 10) in the Science and Mathematics Curriculum Plan.
- Deep Learning Instructor - Thammasat University
  - Conducted training on satellite data processing and interpretation for advanced military and disaster missions at the Faculty of Liberal Arts.
- Senior Project Advisor - Thammasat University
  - Advised students on senior projects in the Department of Geography, Faculty of Liberal Arts.
- AI Instructor - Department of Lands, Thailand
  - Taught AI using land title deed data. Code and Lecture Slides
AI-Powered Image Recognition for Transportation Mobility Factors: A Quality of Life Perspective for Bangkok City
- Urban development hinges on improving the Quality of Life (QOL) for city inhabitants. Traditionally, QOL assessments rely heavily on questionnaire surveys, which, while informative, can be costly and time-consuming.
- Project (GitHub Page), Paper, Code, Cite
Medium-Resolution Vision Transformer for Semantic Segmentation on Landsat Satellite Imagery in Thailand
- This project introduces MeViT (Medium-Resolution Vision Transformer), a novel approach tailored for Landsat satellite imagery of key economic crops in Thailand, including para rubber, corn, and pineapple.
- Project (GitHub Page), Paper, Code, Cite
Flood Risk Assessment in Ayutthaya Province
- This project explores a variety of models, including Random Forest, Gradient Boosting, and Neural Networks, to build a predictive model using relevant features from the dataset.
- Project (GitHub Page), Paper, Code, Cite
The Bangkok Urbanscapes Dataset for Semantic Urban Scene Understanding Using Deep Learning
- To further study self-driving cars in Thailand, we provide both the proposed methods and the proposed dataset in this project. We hope that our architecture and our dataset would help self-driving autonomous developers improve systems for driving in many cities with unique traffic and driving conditions similar to Bangkok and elsewhere in Thailand.
- Project (GitHub Page), Dataset, Paper, Code, Cite

Teerapong Panboonyuen

Senior AI Research Scientist, PostDoc Fellow

MARSAIL, Chula

Selected Awards

Selected Press

Featured Publications

Publications

Popular Topics

Featured Talks

Research Communities