LSTM for time series forecasting

23 Ansichten (letzte 30 Tage)
Nestoras Papadopoulos
Nestoras Papadopoulos am 30 Nov. 2023
Hello,
I am trying to solve a time series forecasting problem and as I am new to deep learning I followed the chickepox example which works well.
For better prediction I want to use 3 more parallel time series data which affect my other time series for prediction. I tried to follow the "time series forecasting using deep learning" example but the training data there, are a part of the time series number. What's the best strategy for handling the above data? Is there an example like mine?
  2 Kommentare
Debraj Maji
Debraj Maji am 25 Dez. 2023
Can you please provide the data you are using?
Nestoras Papadopoulos
Nestoras Papadopoulos am 5 Jan. 2024
My data are in the attached matfile.
time is the hourly times corrspponding to the data.
data holds four timeseries. I want to make prediction for the fourth line. Each cell corresponds to a year.
The fourth timeseries is depended on the three others.
Note: I don't know if it helps, but timeseries in lines 1 and 3 are functions of time. This means that values of them can be calculated in future.
Thanks for your help.

Melden Sie sich an, um zu kommentieren.

Akzeptierte Antwort

Shubham
Shubham am 28 Dez. 2023
Hi Nestoras,
When you're dealing with multiple parallel time series that affect each other, you're looking at a multivariate time series forecasting problem. Deep learning models, particularly Recurrent Neural Networks (RNNs) like Long Short-Term Memory (LSTM) networks, are well-suited for this kind of task.
The general strategy for handling multivariate time series data is to format your data such that each time step includes feature vectors that incorporate all the relevant variables. For example, if you have four parallel time series (including the original one you're trying to predict), each time step in your input data will include a vector with four elements.
Here are some steps to help you adapt the chickenpox example to your multivariate scenario:
  1. Format your data: As shown in the code snippet above, combine your time series into a single dataset where each time step includes all relevant variables.
  2. Define the architecture of your LSTM network: Depending on the complexity of your problem, you might need to adjust the number of LSTM layers and the number of units in each layer.
  3. Train the LSTM network: Use the formatted data to train the network. Make sure to split your data into training and validation sets to monitor for overfitting.
  4. Evaluate the model: After training, evaluate your model on a test set to see how well it generalizes to unseen data.
  5. Forecast future values: Use the trained model to forecast future time steps. For multivariate forecasting, you will need to provide values for all input variables, not just the historical values of the series you're trying to predict.
Remember to normalize your data if necessary, as LSTM networks are sensitive to the scale of the input data. Also, consider experimenting with different architectures, hyperparameters, and data preprocessing techniques to improve performance.
  2 Kommentare
Nestoras Papadopoulos
Nestoras Papadopoulos am 5 Jan. 2024
Nice answer and I have already followed it. Just a couple of more questions:
How I define the appropriate hidden units? I tried some numbers (200-500-1000-2000) and the more units I use the RMSE falls (below 0.1 using 2000) but the processing time increases too much. When is it overfitted?
I have gaps in my data. I know that they shouldn't exist in LSTM, so I started to cut the sequnence just before the gap. Another strategy I tried was to make smaller sequences without gaps containing 200-400 time steps but the sequences weren't of the same length. What is the best option?
Thank you very much.
Nestoras Papadopoulos
Nestoras Papadopoulos am 5 Jan. 2024
Data submitted in previous comment

Melden Sie sich an, um zu kommentieren.

Weitere Antworten (0)

Kategorien

Mehr zu Sequence and Numeric Feature Data Workflows finden Sie in Help Center und File Exchange

Produkte


Version

R2023b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by