Using transformer neural network for classification task

Question

haohaoxuexi1 am 28 Jul. 2024

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/2140801-using-transformer-neural-network-for-classification-task

Kommentiert: Joss Knight am 13 Aug. 2024

numChannels = inputSize;
maxPosition = 256;
numHeads = 4;
numKeyChannels = numHeads*32;
layers = [ 
        sequenceInputLayer(numChannels,Name="input")
        positionEmbeddingLayer(numChannels, maxPosition, Name="pos-emb");
        additionLayer(2, Name="add")
        selfAttentionLayer(numHeads,numKeyChannels,'AttentionMask','causal')
        selfAttentionLayer(numHeads,numKeyChannels)
        indexing1dLayer("last")
        fullyConnectedLayer(numClasses)
        softmaxLayer
        classificationLayer];
lgraph = layerGraph(layers);
lgraph = connectLayers(lgraph, "input", "add/in2");
maxEpochs = 100;
miniBatchSize = 32;
learningRate = 0.001;
solver = 'adam';
shuffle = 'every-epoch';
gradientThreshold = 10;
executionEnvironment = "auto"; % chooses local GPU if available, otherwise CPU
options = trainingOptions(solver, ...
                          'Plots','training-progress', ...
                          'MaxEpochs', maxEpochs, ...
                          'MiniBatchSize', miniBatchSize, ...
                          'Shuffle', shuffle, ...
                          'InitialLearnRate', learningRate, ...
                          'GradientThreshold', gradientThreshold, ...
                          'ExecutionEnvironment', executionEnvironment);

The input size is 12, so there are 12 features.

numClasses is 4, so I am classifying it into 4 class.

But it gives the following error when I try to run it

"

Error in test123_20240727 (line 195)

net=trainNetwork(XTrain, YTrain, layers, options);

Caused by:

Layer 'add': Unconnected input. Each layer input must be connected to the output of another layer.

"

line 195 is "net=trainNetwork(XTrain, YTrain, layers, options);"

Can anyone help me with this?

7 Kommentare
5 ältere Kommentare anzeigen5 ältere Kommentare ausblenden

Umar am 28 Jul. 2024

Hi @ haohaoxuexi1,

By connecting the output of the 'pos-emb' layer to the input of the 'add' layer using connectLayers, and making sure that all layer inputs are properly linked, should resolve the unconnected input error. Here is the updated code.

numChannels = 12;

maxPosition = 256;

numHeads = 4;

numKeyChannels = numHeads * 32;

layers = [

    sequenceInputLayer(numChannels, 'Name', 'input')

    positionEmbeddingLayer(numChannels, maxPosition, 'Name', 'pos-emb')

    additionLayer(2, 'Name', 'add')

    selfAttentionLayer(numHeads, numKeyChannels, 'AttentionMask',

'causal')

    selfAttentionLayer(numHeads, numKeyChannels)

    indexing1dLayer('last')

    fullyConnectedLayer(4)

    softmaxLayer

    classificationLayer];

lgraph = layerGraph(layers);

lgraph = connectLayers(lgraph, 'pos-emb', 'add/in2'); % Connect 'pos-emb'

output to 'add' input

maxEpochs = 100;

miniBatchSize = 32;

learningRate = 0.001;

solver = 'adam';

shuffle = 'every-epoch';

gradientThreshold = 10;

executionEnvironment = 'auto'; % chooses local GPU if available, otherwise CPU

options = trainingOptions(solver, ...

    'Plots', 'training-progress', ...

    'MaxEpochs', maxEpochs, ...

    'MiniBatchSize', miniBatchSize, ...

    'Shuffle', shuffle, ...

    'InitialLearnRate', learningRate, ...

    'GradientThreshold', gradientThreshold, ...

    'ExecutionEnvironment', executionEnvironment);

% Assuming XTrain and YTrain are your training data

net = trainNetwork(XTrain, YTrain, lgraph, options); % Use lgraph instead of layers

Hope this should help resolve your problem.

Umar am 29 Jul. 2024

Hi @ haohaoxuexi1,

If you are still having issues with modifying your code, please let us know. We will be happy to help you out.

haohaoxuexi1 am 29 Jul. 2024

@Umar Hi Umar, I am good at the moment. Will let u know if I have further question.

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Joss Knight am 29 Jul. 2024

0
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/2140801-using-transformer-neural-network-for-classification-task#answer_1492021

You've passed layers instead of lgraph to trainNetwork.

2 Kommentare
Keine anzeigenKeine ausblenden

Umar am 29 Jul. 2024

@Joss Knight, Thanks for jumping in. Please advice how to use lgraph to trainNetwork by providing code snippet. Again, thanks for your cooperation.

Joss Knight am 13 Aug. 2024

In MATLAB Online öffnen

net=trainNetwork(XTrain, YTrain, lgraph, options);

instead of

net=trainNetwork(XTrain, YTrain, layers, options);

Melden Sie sich an, um zu kommentieren.

Using transformer neural network for classification task

7 Kommentare
5 ältere Kommentare anzeigen5 ältere Kommentare ausblenden

Akzeptierte Antwort

2 Kommentare
Keine anzeigenKeine ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

Using transformer neural network for classification task

7 Kommentare 5 ältere Kommentare anzeigen5 ältere Kommentare ausblenden

Akzeptierte Antwort

2 Kommentare Keine anzeigenKeine ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

7 Kommentare
5 ältere Kommentare anzeigen5 ältere Kommentare ausblenden

2 Kommentare
Keine anzeigenKeine ausblenden