HOME > Research

Agentic AI for Multi-Source PM2.5 Pollution Analysis and Location-Based Mitigation Strategies: A Case Study in Thailand and Japan

Introduction

The Problem: Seasonal Haze in Northern Thailand

Provinces like Chiang Mai and Mae Hong Son face extreme $PM_{2.5}$ levels, often exceeding $200\text{–}300\mu g/m^3$ during the burning season (January–April). This leads to a 20–30% spike in hospital admissions for respiratory issues. Existing forecasting systems are hindered by sparse monitoring networks and a lack of physical grounding, often failing to provide actionable early warnings.

The Solution: An Integrated AI Playground

This project delivers an interactive environment that performs two critical tasks:

Forward Forecasting: Predicting $PM_{2.5}$ levels one hour ahead on a spatial grid.
Inverse Source Identification: Tracing predicted pollution back to specific origins like hotspots, factories, or cross-border smoke.

Key Features

Hybrid SVR-PICNN Architecture: Combines Support Vector Regression (SVR) for temporal trends with Physics-Informed Convolutional Neural Networks (PICNN) for spatial accuracy.
Physics-Informed Constraints: The model enforces the Advection–Diffusion–Reaction (ADR) equation, ensuring predictions follow the laws of atmospheric physics.
Inverse Source Search: Utilizes DeepONet and Graph Attention Networks (GAT) to discover hidden emission sources from sparse sensor data.
Multi-Source Data Pipeline: Integrates hourly $PM_{2.5}$ measurements, NASA FIRMS fire hotspots, factory inventories (DIW), and vehicle statistics (DLT).
Robust Data Imputation: Employs PCHIP and linear interpolation to maintain a continuous hourly data stream despite sensor outages.

Development and Innovation

The framework is built on a "Forward-Reverse" dual engine that balances raw data with physical reality.

1. Forward Forecasting (The Predictive Engine)

The system uses a parallel model structure:

SVR Component: Handles the coarse-grained time-series prediction of $PM_{2.5}$ concentrations.
PICNN Component: Refines the spatial distribution. Because it is "physics-informed," it penalizes predictions that violate the ADR equation, which describes how pollutants move (advection) and spread (diffusion).

2. Inverse Modeling (The Forensic Engine)

To identify where pollution comes from, the team introduced a DeepONet Graph Attention PINN.

Graph Neural Networks (GNN): Ideal for Thailand's irregular monitoring network, where stations are scattered unevenly.
DeepONet: Maps input functions (like wind fields) to output functions (concentration maps), allowing the system to generalize across different weather scenarios without retraining.

Results and Impact

Performance Validation

Experiments using real-world data from nine monitoring stations revealed that the parallel architecture (SVR and PICNN running in tandem) achieved the strongest predictive performance. By enforcing physical continuity, the model remains stable even when ground-sensor data is noisy or missing.

Future Directions

The framework establishes a foundation for:

Policy Simulation: Testing how "what-if" scenarios (e.g., a 50% reduction in farm burning) would impact air quality.
Real-Time Warnings: Providing high-resolution maps for public health alerts.
Source Attribution: Offering transparent data to stakeholders regarding industrial vs. agricultural contributions to haze.

‍

Project Advisor(s)

Charnon Pattiyanon

Assistant Director of IT

Research Team member(s)

Chutikarn Kanchanaart

Undergraduate Student

Chavakorn Arunkunarax

Undergraduate Student

Kasidith Saetang

Undergraduate Student