Expand description
Mamba inference implementation.
See “Mamba: Linear-Time Sequence Modeling with Selective State Spaces”
Based on reference implementation from the AlbertMamba project A fast implementation of mamba for inference only. Based on Laurent Mazare’s rust implementation: mamba.rs