Skip to content

Decoder Architecture #76

@zeyadmahmoudd

Description

@zeyadmahmoudd

Thank you for this nice work.

in the decoder you concatenate stages 2, 3 and 4 from the encoder, taking tiny model as an example the number of channels will be 64+160+256=480, while decoder dimension is 256 so we need channel mixing of conv1x1 with input 480 and output 256 channels but that doesnt seem to be the case in your released weights

Image

other thing which matrix decomposition did you adopt in hamburger decoder, also to make sure the head in the decoder is a hamburger head. What are its parameters? and in the final mlp is it only one layer to map to the number of classes or has several hidden layers?

Image

Once again, thank you for this amazing work and for your effort.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions