Hi,
I recently learnt about this selective SSM architecture, and it was awesome!
But I have some questions. We know that the Transformer architecture supports sequence parallelism, so does Mamba (the potential alternative of Transformer) support sequence parallelism?