Work

Takeaway

  • Train Mamba: https://www.youtube.com/watch?v=qUfZruIKwtc&ab_channel=Oxen

  • https://www.youtube.com/watch?v=qUfZruIKwtc&ab_channel=Oxen. –> for RAG

  • image-20240203183151059

https://www.youtube.com/@oxen-ai/videos. –> Oxen is an excellent programmer

Reference

Hepta. “How to Judge RWKV (arXiv 2305.13048)?,” September 15, 2023. https://www.zhihu.com/question/602564718/answer/3211669817.

[Efficiently Modeling Long Sequences with Structured State Spaces - Albert Gu Stanford MLSys #46 (youtube.com)](https://www.youtube.com/watch?v=EvQ3ncuriCM)

Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained) (youtube.com)

https://www.youtube.com/watch?v=8Q_tqwpTpVU&ab_channel=UmarJamil

https://www.youtube.com/watch?v=iskuX3Ak9Uk&ab_channel=TrelisResearch. Good!