近年来,基于深度学习方法(尤其是递归神经网络(RNN))的短期流量预测受到了广泛关注。但是,就时空数据的预测能力和处理丢失数据的能力而言,基于RNN的模型在交通预测中的潜力尚未得到充分利用。在本文中,我们将重点放在基于RNN的模型上,并尝试重新构建将RNN及其变体合并到流量预测模型中的方式。提出了一种堆叠的双向和单向LSTM网络体系结构(SBU-LSTM),以协助设计用于交通状态预测的神经网络结构。作为体系结构的关键组件,双向LSTM(BDLSM)被用来捕获时空数据中的前向和后向时间依赖性。为了处理时空数据中的缺失值,我们还设计了一种插补单元以推断缺失值并协助流量预测,从而在LSTM结构(LSTM-1)中提出了一种数据插补机制。LSTM-1的双向版本已包含在SBU-LSTM体系结构中。两个真实世界范围内的流量状态数据集用于进行实验并发布,以促进进一步的流量预测研究。评估了多种类型的多层LSTM或BDLSTM模型的预测性能。实验结果表明,所提出的SBU-LSTM体系结构,尤其是两层BDLSTM网络,可以在准确性和鲁棒性方面实现全网流量预测的卓越性能。进一步,
Short-term traffic forecasting based on deep learning methods, especially recurrent neural networks (RNN), has received much attention in recent years. However, the potential of RNN-based models in traffic forecasting has not yet been fully exploited in terms of the predictive power of spatial–temporal data and the capability of handling missing data. In this paper, we focus on RNN-based models and attempt to reformulate the way to incorporate RNN and its variants into traffic prediction models. A stacked bidirectional and unidirectional LSTM network architecture (SBU-LSTM) is proposed to assist the design of neural network structures for traffic state forecasting. As a key component of the architecture, the bidirectional LSTM (BDLSM) is exploited to capture the forward and backward temporal dependencies in spatiotemporal data. To deal with missing values in spatial–temporal data, we also propose a data imputation mechanism in the LSTM structure (LSTM-I) by designing an imputation unit to infer missing values and assist traffic prediction. The bidirectional version of LSTM-I is incorporated in the SBU-LSTM architecture. Two real-world network-wide traffic state datasets are used to conduct experiments and published to facilitate further traffic prediction research. The prediction performance of multiple types of multi-layer LSTM or BDLSTM models is evaluated. Experimental results indicate that the proposed SBU-LSTM architecture, especially the two-layer BDLSTM network, can achieve superior performance for the network-wide traffic prediction in both accuracy and robustness. Further, comprehensive comparison results show that the proposed data imputation mechanism in the RNN-based models can achieve outstanding prediction performance when the model’s input data contains different patterns of missing values.