Skip to main navigation Skip to search Skip to main content

Comparative Analysis of Transformer and LSTM Networks for Food Production Forecasting

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

For many agricultural authorities, predicting food production demand efficiently is a challenge. Precise production estimates are necessary for a strong supply chain, which is essential to avoid situations of over- or under-production. Demand forecasting methods powered by Machine Learning are capable of making predictions about trends and behaviors based on the collected historical data. Previous research has focused on improving prediction accuracy using the sequential model, namely Long Short-Term Memory (LSTM), and the integration of non-sequential/static features such as temperature, rainy day count, and production month. Two LSTM variations were created, and the prediction of food production was used to compare the two types’ performances. The findings reveal that the incorporation of static features leads to a notable enhancement in the forecasting accuracy of items that are affected by them. Building on previous findings, this paper investigates the topic by applying Transformer encoder models. The choice between Transformers and LSTMs for time series forecasting depends on the nature of the problem and the dataset, the availability of the computational resources, and the complexity of the relationships within the data. Unlike LSTM, Transformers can excel in capturing long-range dependencies and process inputs in parallel due to their attention mechanism. This parallelization can lead to faster training time. Moreover, they can recognize the importance of different elements in the input sequence and ignore irrelevant or noisy data. Our experiments show that the performance of the Transformer models on the food production dataset outperformed the LSTM models, thereby expanding the body of knowledge regarding efficient time series forecasting techniques.

Original languageBritish English
Title of host publicationComputational Intelligence - 14th and 15th International Joint Conference on Computational Intelligence IJCCI 2022 and IJCCI 2023, Revised Selected Papers
EditorsThomas Bäck, Niki van Stein, Christian Wagner, Jonathan M. Garibaldi, Francesco Marcelloni, H.K. Lam, Marie Cottrell, Faiyaz Doctor, Joaquim Filipe, Kevin Warwick, Janusz Kacprzyk
PublisherSpringer Science and Business Media Deutschland GmbH
Pages149-164
Number of pages16
ISBN (Print)9783031852510
DOIs
StatePublished - 2025
Event14th and 15th International Joint Conference on Computational Intelligence, IJCCI 2022 and IJCCI 2023 - Rome, Italy
Duration: 13 Nov 202315 Nov 2023

Publication series

NameStudies in Computational Intelligence
Volume1196 SCI
ISSN (Print)1860-949X
ISSN (Electronic)1860-9503

Conference

Conference14th and 15th International Joint Conference on Computational Intelligence, IJCCI 2022 and IJCCI 2023
Country/TerritoryItaly
CityRome
Period13/11/2315/11/23

Keywords

  • Encoder
  • Forecasting
  • LSTM
  • Non-sequential features
  • Sequential features
  • Supply chain management
  • Time series analysis
  • Transformers

Fingerprint

Dive into the research topics of 'Comparative Analysis of Transformer and LSTM Networks for Food Production Forecasting'. Together they form a unique fingerprint.

Cite this