-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Hello,
Thanks for your excellent work! I'm reading this paper and having one question on x^r and q_sa^r and k_sa^r. I'm wondering what is the specific relationship between q_sa^r ,k_sa^r and x^r? Since from the paper I learnt that q_sa^r and k_sa^r are calculated form the downsampled feature x_sa. And x_sa is basically has not impact on both x^t and x^r which means it is the same to x^t as well as x^r. I guess I didn't get the part of how you get q_sa^r ,k_sa^r from x^r. Would you please give a more detailed explanation or formula/figure on this? Thanks!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels