The 2-Minute Rule for mamba paper
We modified the Mamba's internal equations so to simply accept inputs from, and Blend, two individual info streams. To the top of our information, this is the initially try to adapt the equations of SSMs to your eyesight endeavor like design and style transfer without demanding another module like cross-interest or tailor made normalization layers.