What is it about?

This study introduces a seasonal modeling approach in the prediction of daily average PM10 (particulate matter with an aerodynamic diameter<10 μm) levels 1 day ahead based on multilayer perceptron artificial neural network (MLP-ANN) forecasters. The data set covered all daily based meteorological parameters and PM10 concentrations in the period of 2007-2014. Seasonal ANN models for winter and summer periods were separately developed and trained by using a lagged time series data set. The most significant lagged terms of the variables within a 1-week period were determined by principal component analysis (PCA) and assigned as input vectors of ANN models. Cascading training with error back-propagation method was applied in model building. The use of seasonal ANN models with PCA-based inputs showed an increased prediction performance compared with nonseasonal models. For seasonal ANN models, the overall model agreement in training between modeled and observed values varied in the range of 0.78-0.83 and R2 values ranged in 0.681-0.727, which outperformed nonseasonal models. The best testing R2 values of seasonal models for winter and summer periods ranged in 0.709-0.727 with lower testing error, and the models did not show a tendency towards overpredicting or underpredicting the PM10 levels. The approach demonstrated in the study appeared to be promising for predicting short-term levels of pollutants through the data sets with high irregularities and could have significant applicability in the case of large number of considered inputs.

Featured Image

Why is it important?

This study attempts at designing individual seasonal ANN models for reasonable predictions of daily average PM10 concentrations 1 day ahead by using time series data, taking into account the effects from local meteorological conditions. A lagged time series data set was produced by back-shifting up to 7 days to select significant lags of variables as input vectors to the ANN models. Furthermore, PCA method was employed before constructing ANN models in determining of proper consecutive inputs from lagged variables in the data set.

Perspectives

The main goal of this study was to improve ANN model predictions of PM10 levels 1 day ahead by applying PCA method in the selection of the most significant lagged terms of the variables as inputs, developing seasonal ANN models for winter and summer periods, and comparing singular ANN models as benchmark. The most significant lagged inputs were three consecutive lags of PM10 and AT according to PCA runs. In training of ANN models, cascading-training procedure provided by FANN library was employed, which produced reasonably successful models. It may thus be an alternative way to determine the right number of hidden units in the middle layer of ANNs. For seasonal ANN models, the overall model agreement in training between modeled and observed values varied in the range of 0.78–0.83 and R2 values ranged in 0.681–0.727. The best testing R2 values of seasonal models for winter and summer period models ranged in 0.709–0.711, with lower testing RMSE values comparing with the nonseasonal models. Also, seasonal models did not show a tendency towards overpredicting or underpredicting the daily average PM10 levels 1 day ahead with better estimates. This approach appeared to be promising in capturing nonlinear features in data and could have significant applicability in the case of the data sets with a large number of considered inputs.

Dr Fatih Taşpınar
Duzce Universitesi

Read the Original

This page is a summary of: Improving artificial neural network model predictions of daily average PM10 concentrations by applying principle component analysis and implementing seasonal models, Journal of the Air & Waste Management Association, February 2015, Taylor & Francis,
DOI: 10.1080/10962247.2015.1019652.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page