Skip to content

Question on xr and qr kr. #1

@Capchenxi

Description

@Capchenxi

Hello,

Thanks for your excellent work! I'm reading this paper and having one question on x^r and q_sa^r and k_sa^r. I'm wondering what is the specific relationship between q_sa^r ,k_sa^r and x^r? Since from the paper I learnt that q_sa^r and k_sa^r are calculated form the downsampled feature x_sa. And x_sa is basically has not impact on both x^t and x^r which means it is the same to x^t as well as x^r. I guess I didn't get the part of how you get q_sa^r ,k_sa^r from x^r. Would you please give a more detailed explanation or formula/figure on this? Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions