Hello there! What I meant as components in my comment are like the attention mechanism itself. For reference, here are the mean weights of two models I’m studying.
Hello there! What I meant as components in my comment are like the attention mechanism itself. For reference, here are the mean weights of two models I’m studying.