lstmProjectedLayer

Long short-term memory (LSTM) projected layer for recurrent neural network (RNN)

Since R2022b

Description

An LSTM projected layer is an RNN layer that learns long-term dependencies between time steps in time-series and sequence data using projected learnable weights.

To compress a deep learning network, you can use projected layers. A projected layer is a type of deep learning layer that enables compression by reducing the number of stored learnable parameters. The layer introduces learnable projector matrices Q, replaces multiplications of the form $W x$ , where W is a learnable matrix, with the multiplication $W Q Q^{⊤} x$ , and stores Q and $W' = W Q$ instead of storing W. Projecting x into a lower dimensional space using Q typically requires less memory to store the learnable parameters and can have similarly strong prediction accuracy.

Reducing the number of learnable parameters by projecting an LSTM layer rather than reducing the number of hidden units of the LSTM layer maintains the output size of the layer and, in turn, the sizes of the downstream layers, which can result in better prediction accuracy.

Creation

Syntax

layer = lstmProjectedLayer(numHiddenUnits,outputProjectorSize,inputProjectorSize)

layer = lstmProjectedLayer(___,Name=Value)

Description

layer = lstmProjectedLayer(numHiddenUnits,outputProjectorSize,inputProjectorSize) creates an LSTM projected layer and sets the NumHiddenUnits, OutputProjectorSize, and InputProjectorSize properties.

example

layer = lstmProjectedLayer(___,Name=Value) sets the OutputMode, HasStateInputs, HasStateOutputs, Activations, State, Parameters and Initialization, Learning Rate and Regularization, and Name properties using one or more name-value arguments.

Tip

To compress a neural network using projection, use the compressNetworkUsingProjection function.

example

Properties

expand all

Projected LSTM

`NumHiddenUnits` — Number of hidden units
positive integer

Number of hidden units (also known as the hidden size), specified as a positive integer.

The number of hidden units corresponds to the amount of information that the layer remembers between time steps (the hidden state). The hidden state can contain information from all the previous time steps, regardless of the sequence length. If the number of hidden units is too large, then the layer can overfit to the training data. The hidden state does not limit the number of time steps that the layer processes in an iteration.

The layer outputs data with NumHiddenUnits channels.

To set this property, use the numHiddenUnits argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

`OutputProjectorSize` — Output projector size
positive integer

Output projector size, specified as a positive integer.

The LSTM layer operation uses four matrix multiplications of the form $R h_{t - 1}$ , where R denotes the recurrent weights and h_t denotes the hidden state (or, equivalently, the layer output) at time step t.

The LSTM projected layer operation instead uses multiplications of the from $R Q_{o} Q_{o}^{⊤} h_{t - 1}$ , where Q_o is a NumHiddenUnits-by-OutputProjectorSize matrix known as the output projector. The layer uses the same projector Q_o for each of the four multiplications.

To perform the four multiplications of the form $R h_{t - 1}$ , an LSTM layer stores four recurrent weights matrices R, which necessitates storing 4*NumHiddenUnits^2 learnable parameters. By instead storing the 4*NumHiddenUnits-by-OutputProjectorSize matrix $R' = R Q_{o}$ and Q_o, an LSTM projected layer can perform the multiplication $R Q_{o} Q_{o}^{⊤} h_{t - 1}$ and store only 5*NumHiddenUnits*OutputProjectorSize learnable parameters.

To set this property, use the outputProjectorSize argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

Tip

To ensure that $R Q_{o} Q_{o}^{⊤} h_{t - 1}$ requires fewer learnable parameters, set the OutputProjectorSize property to a value less than 4*NumHiddenUnits/5.

`InputProjectorSize` — Input projector size
positive integer

Input projector size, specified as a positive integer.

The LSTM layer operation uses four matrix multiplications of the form $W x_{t}$ , where W denotes the input weights and x_t denotes the layer input at time step t.

The LSTM projected layer operation instead uses multiplications of the from $W Q_{i} Q_{i}^{⊤} x_{t}$ , where Q_i is an InputSize-by-InputProjectorSize matrix known as the input projector. The layer uses the same projector Q_i for each of the four multiplications.

To perform the four multiplications of the form $W x_{t}$ , an LSTM layer stores four weight matrices W, which necessitates storing 4*NumHiddenUnits*InputSize learnable parameters. By instead storing the 4*NumHiddenUnits-by-InputProjectorSize matrix $W' = W Q_{i}$ and Q_i, an LSTM projected layer can perform the multiplication $W Q_{i} Q_{i}^{⊤} x_{t}$ and store only (4*NumHiddenUnits+InputSize)*InputProjectorSize learnable parameters.

To set this property, use the inputProjectorSize argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

Tip

To ensure that $W Q_{i} Q_{i}^{⊤} x_{t}$ requires fewer learnable parameters, set the InputProjectorSize property to a value less than 4*NumHiddenUnits*inputSize/(4*NumHiddenUnits+inputSize).

`OutputMode` — Output mode
`"sequence"` (default) | `"last"`

Output mode, specified as one of these values:

"sequence" — Output the complete sequence.
"last" — Output the last time step of the sequence.

The LSTMProjectedLayer object stores this property as a character vector.

To set this property, use the corresponding name-value argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

`HasStateInputs` — Flag for state inputs to layer
`0` (`false`) (default) | `1` (`true`)

This property is read-only.

Flag for state inputs to the layer, specified as 0 (false) or 1 (true).

If the HasStateInputs property is 0 (false), then the layer has one input with the name "in", which corresponds to the input data. In this case, the layer uses the HiddenState and CellState properties for the layer operation.

If the HasStateInputs property is 1 (true), then the layer has three inputs with the names "in", "hidden", and "cell", which correspond to the input data, hidden state, and cell state, respectively. In this case, the layer uses the values passed to these inputs for the layer operation. If HasStateInputs is 1 (true), then the HiddenState and CellState properties must be empty.

`HasStateOutputs` — Flag for state outputs from layer
`0` (`false`) (default) | `1` (`true`)

This property is read-only.

Flag for state outputs from the layer, specified as 0 (false) or 1 (true).

If the HasStateOutputs property is 0 (false), then the layer has one output with the name "out", which corresponds to the output data.

If the HasStateOutputs property is 1 (true), then the layer has three outputs with the names "out", "hidden", and "cell", which correspond to the output data, hidden state, and cell state, respectively. In this case, the layer also outputs the state values that it computes.

`InputSize` — Input size
`"auto"` (default) | positive integer

This property is read-only.

Input size, specified as a positive integer or "auto". If InputSize is "auto", then the software automatically assigns the input size at training time.

If InputSize is "auto", then the LSTMProjectedLayer object stores this property as a character vector.

Data Types: double | char | string

Activations

`StateActivationFunction` — Activation function to update cell and hidden state
`"tanh"` (default) | `"softsign"` | `"relu"` (since R2024a)

This property is read-only.

Activation function to update the cell and hidden state, specified as one of these values:

"tanh" — Use the hyperbolic tangent function (tanh).
"softsign" — Use the softsign function $softsign (x) = \frac{x}{1 + | x |}$ .
"relu" (since R2024a) — Use the rectified linear unit (ReLU) function $ReLU (x) = {\begin{matrix} x, & x > 0 \\ 0, & x \leq 0 \end{matrix}$ .

The software uses this option as the function $σ_{c}$ in the calculations to update the cell and hidden state.

For more information on how an LSTM layer uses activation functions, see Long Short-Term Memory Layer.

The LSTMProjectedLayer object stores this property as a character vector.

`GateActivationFunction` — Activation function to apply to gates
`"sigmoid"` (default) | `"hard-sigmoid"`

Activation function to apply to the gates, specified as one of these values:

"sigmoid" — Use the sigmoid function, $σ (x) = {(1 + e^{- x})}^{- 1}$ .
"hard-sigmoid" — Use the hard sigmoid function,

$σ (x) = {\begin{matrix} \begin{array}{l} 0 \\ 0.2 x + 0.5 \\ 1 \end{array} & \begin{array}{l} if x < - 2.5 \\ if - 2.5 \leq x \leq 2.5 \\ if x > 2.5 \end{array} \end{matrix} .$

The software uses this option as the function $σ_{g}$ in the calculations for the layer gates.

The LSTMProjectedLayer object stores this property as a character vector.

To set this property, use the corresponding name-value argument when you create the LSTMProjectedLayer object. After you create a LSTMProjectedLayer object, this property is read-only.

State

`CellState` — Cell state
`[]` (default) | numeric vector

Cell state to use in the layer operation, specified as a NumHiddenUnits-by-1 numeric vector. This value corresponds to the initial cell state when data is passed to the layer.

After you set this property manually, calls to the resetState function set the cell state to this value.

If HasStateInputs is 1 (true), then the CellState property must be empty.

Data Types: single | double

`HiddenState` — Hidden state
`[]` (default) | numeric vector

Hidden state to use in the layer operation, specified as a NumHiddenUnits-by-1 numeric vector. This value corresponds to the initial hidden state when data is passed to the layer.

After you set this property manually, calls to the resetState function set the hidden state to this value.

If HasStateInputs is 1 (true), then the HiddenState property must be empty.

Data Types: single | double

Parameters and Initialization