In "divided_space_time", Why is the class token not used during temporal attention?

Thank you for your great paper. 
I have a question while reading your paper and code. 
In the paper, the formula for performing divided_space_time includes the class token during temporal attention, but in the actual code, the cls_token is excluded during temporal attention and included during spatial attention. 
![image](https://user-images.githubusercontent.com/52914552/222940468-3ca1c36f-e187-47bb-bc1f-1128c1e2253e.png)
![image](https://user-images.githubusercontent.com/52914552/222940470-25916b54-7f51-4669-bb70-e2ee496ef8e3.png)

This question is a duplicate of the following question, but I did not get a satisfactory answer.
[https://github.com/facebookresearch/TimeSformer/issues/74](url)
Is there a reason for this? Would it be possible to obtain experimental results by including the cls_token during temporal attention?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In "divided_space_time", Why is the class token not used during temporal attention? #125

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

In "divided_space_time", Why is the class token not used during temporal attention? #125

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions