W-WaveNet: A multi-site water quality prediction model incorporating adaptive graph convolution and CNN-LSTM

doi:10.1371/journal.pone.0276155

Fig 1.

Schematic diagram of the fusion model this model integrates WaveNet network, LSTM network and GCN network.

The data are first processed using convolution in the temporal dimension, followed by a spatial convolutional network in the spatial dimension, and finally reinforced by an LSTM network to correlate the front-to-back dependencies of the data.

More »

Expand

Fig 2.

The geographical position and site distribution of section A.

More »

Expand

Table 1.

Data statistics of each water pollution factor in section A.

More »

Expand

Table 2.

Data statistics of each water pollution factor in section B.

More »

Expand

Fig 3.

Framework of the adaptive graph convolution.

It contains 3 components A_α, B_α, and C_α. A_α is used to represent the geographical information of the nodes, B_α is a random initialization matrix to enhance the flexibility of the model, and C_α is used to represent the learnable node embeddings.

More »

Expand

Fig 4.

Framework of the WaveNet block.

It employs dilated causal convolution as a foundation and includes a gated mechanism and a residual connection.

More »

Expand

Fig 5.

Framework of the CNN-LSTM hybrid model.

It contains multiple convolution layers, which are finally output through an LSTM and a full connection layer.

More »

Expand

Fig 6.

Schematic representation of spatio-temporal network fusion strategy.

The GCN network immediately follows the CNN to form a spatio-temporal processing module, which is stacked to handle spatial dependencies of different spans, and finally, the LSTM network outputs the results.

More »

Expand

Fig 7.

Framework of a single spatio-temporal network (ST-Block).

Creating a spatio-temporal block by combining the WaveNet Network and Adaptive Graph Convolution Network previously described. Added residual network to AGCN.

More »

Expand

Fig 8.

Framework of W-WaveNet.

It combines the skip result of each spatio-temporal block with the stacked output, and finally compute the result through the LSTM network. All space-time blocks are connected to each other by skip connections.

More »

Expand

Fig 9.

Correlation heatmap of muti-site water quality data.

(A) Correlation heatmap among sites. The correlation between sites varies, some sites can be correlated up to 0.75. while some sites correlation is 0. (B) Correlation heatmap with site 4 time point 7 at different times and different site. The correlation between the same site and site 4 time point 7 reaches its maximum at a certain point, which is related to the distance between the site and site 4.

More »

Expand

Fig 10.

Average number of correlated sites for each factor in Sections A and B.

This graph demonstrates the different site correlations that exist for the different factor data. Some of these data with low correlations may affect the results.

More »