TY - GEN
T1 - Comparative Analysis of Transformer and LSTM Networks for Food Production Forecasting
AU - Alkaabi, Nouf
AU - Shakya, Siddhartha
AU - Mizouni, Rabeb
AU - Mio, Corrado
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - For many agricultural authorities, predicting food production demand efficiently is a challenge. Precise production estimates are necessary for a strong supply chain, which is essential to avoid situations of over- or under-production. Demand forecasting methods powered by Machine Learning are capable of making predictions about trends and behaviors based on the collected historical data. Previous research has focused on improving prediction accuracy using the sequential model, namely Long Short-Term Memory (LSTM), and the integration of non-sequential/static features such as temperature, rainy day count, and production month. Two LSTM variations were created, and the prediction of food production was used to compare the two types’ performances. The findings reveal that the incorporation of static features leads to a notable enhancement in the forecasting accuracy of items that are affected by them. Building on previous findings, this paper investigates the topic by applying Transformer encoder models. The choice between Transformers and LSTMs for time series forecasting depends on the nature of the problem and the dataset, the availability of the computational resources, and the complexity of the relationships within the data. Unlike LSTM, Transformers can excel in capturing long-range dependencies and process inputs in parallel due to their attention mechanism. This parallelization can lead to faster training time. Moreover, they can recognize the importance of different elements in the input sequence and ignore irrelevant or noisy data. Our experiments show that the performance of the Transformer models on the food production dataset outperformed the LSTM models, thereby expanding the body of knowledge regarding efficient time series forecasting techniques.
AB - For many agricultural authorities, predicting food production demand efficiently is a challenge. Precise production estimates are necessary for a strong supply chain, which is essential to avoid situations of over- or under-production. Demand forecasting methods powered by Machine Learning are capable of making predictions about trends and behaviors based on the collected historical data. Previous research has focused on improving prediction accuracy using the sequential model, namely Long Short-Term Memory (LSTM), and the integration of non-sequential/static features such as temperature, rainy day count, and production month. Two LSTM variations were created, and the prediction of food production was used to compare the two types’ performances. The findings reveal that the incorporation of static features leads to a notable enhancement in the forecasting accuracy of items that are affected by them. Building on previous findings, this paper investigates the topic by applying Transformer encoder models. The choice between Transformers and LSTMs for time series forecasting depends on the nature of the problem and the dataset, the availability of the computational resources, and the complexity of the relationships within the data. Unlike LSTM, Transformers can excel in capturing long-range dependencies and process inputs in parallel due to their attention mechanism. This parallelization can lead to faster training time. Moreover, they can recognize the importance of different elements in the input sequence and ignore irrelevant or noisy data. Our experiments show that the performance of the Transformer models on the food production dataset outperformed the LSTM models, thereby expanding the body of knowledge regarding efficient time series forecasting techniques.
KW - Encoder
KW - Forecasting
KW - LSTM
KW - Non-sequential features
KW - Sequential features
KW - Supply chain management
KW - Time series analysis
KW - Transformers
UR - https://www.scopus.com/pages/publications/105002044544
U2 - 10.1007/978-3-031-85252-7_9
DO - 10.1007/978-3-031-85252-7_9
M3 - Conference contribution
AN - SCOPUS:105002044544
SN - 9783031852510
T3 - Studies in Computational Intelligence
SP - 149
EP - 164
BT - Computational Intelligence - 14th and 15th International Joint Conference on Computational Intelligence IJCCI 2022 and IJCCI 2023, Revised Selected Papers
A2 - Bäck, Thomas
A2 - van Stein, Niki
A2 - Wagner, Christian
A2 - Garibaldi, Jonathan M.
A2 - Marcelloni, Francesco
A2 - Lam, H.K.
A2 - Cottrell, Marie
A2 - Doctor, Faiyaz
A2 - Filipe, Joaquim
A2 - Warwick, Kevin
A2 - Kacprzyk, Janusz
PB - Springer Science and Business Media Deutschland GmbH
T2 - 14th and 15th International Joint Conference on Computational Intelligence, IJCCI 2022 and IJCCI 2023
Y2 - 13 November 2023 through 15 November 2023
ER -