Part 11 (10 points, non-coding task)
So far, we have proved that GQA can always be represented by MLA.
In this part, you are asked to prove that GQA is not equivalent to MLA. What you need to do is to find one example that MLA cannot be represented as GQA.
To be specific, please do the following things:
-
Construct \color{red}{\mathbf{W}^{\mathbf{DKV}, MLA}} \in \Bbb R^{1 \times 2}.
-
Construct \color{blue}{\mathbf{W}^{\mathbf{UM}, MLA}} \in \Bbb R^{2 \times 1}.
-
Do matrix multiplication \color{blue}{\mathbf{W}^{\mathbf{UM}, MLA}} \color{red}{\mathbf{W}^{\mathbf{DKV}, MLA}}.
-
Show that this product matrix is not the concatenation of two copies of 1-by-2 matrices along axis 0.