Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

Question

John Smith am 13 Mär. 2023

0
Verknüpfen

Direkter Link zu dieser Frage

https://de.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension

Kommentiert: Artem Lensky am 19 Aug. 2023

Hello,

While implementing a ViT transformer in Matlab, I found at that the concatLayer does not concatenate over the T dimension. This is needed to concatenate the class token with patch tokens, since the natural representation is CBT with C corresponding to features, B to batch and T to token within a batch (this is also the canonical representation in the attention function).

It's possible to work around this by hacking to e.g. SCB, but then other problems pop up which also need to be hacked around.

Thx

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Melden Sie sich an, um zu kommentieren.

Melden Sie sich an, um diese Frage zu beantworten.

Answer 1

Ben am 14 Mär. 2023

1
Verknüpfen

Direkter Link zu dieser Antwort

https://de.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension#answer_1192820

You can create a layer that concatenates on the T dimension with functionLayer

sequenceCatLayer = functionLayer(@(x,y) cat(3,x,y));

This will work in dlnetwork to concatenate two CBT dlarray-s.

Since you're concatenating the class token, it might also be worth considering creating a custom layer that has the class token embedding as a Learnable property, and performs the concatenation in the predict method.

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Catalytic am 23 Mär. 2023

Bearbeitet: Catalytic am 23 Mär. 2023

@John Smith - Since Ben's answer yielded a solution for you, you should hit the Accept this Answer button, and likewise with other answers you might not have accepted.

Artem Lensky am 19 Aug. 2023

Are there any plans to make concatenationLayer support concatetnation along the T dimension?

Melden Sie sich an, um zu kommentieren.

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 Kommentare -2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

Akzeptierte Antwort

3 Kommentare 1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden

Weitere Antworten (0)

Siehe auch

Kategorien

Tags

Produkte

Version

Community Treasure Hunt

0 Kommentare
-2 ältere Kommentare anzeigen-2 ältere Kommentare ausblenden

3 Kommentare
1 älteren Kommentar anzeigen1 älteren Kommentar ausblenden