The MAMBA Model transformer by using a language modeling head on major (linear layer with weights tied for the input
which describes how all The inner states are related because they symbolize the underlying dynamics https://k2spiceshop.com/product/liquid-k2-on-paper-online/