Skip to content

Bug with mask? #4

@dingjiansw101

Description

@dingjiansw101

mask = mask.unsqueeze(1).repeat(1, self.num_heads, 1, 1) # BxNxQ_LENxK_LEN

It seems the mask is not correct. Since there is a permute of query, key, and value. The mask should also has a permute.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions