Takeaway
-
Train Mamba: https://www.youtube.com/watch?v=qUfZruIKwtc&ab_channel=Oxen
-
https://www.youtube.com/watch?v=qUfZruIKwtc&ab_channel=Oxen. –> for RAG
-
https://www.youtube.com/@oxen-ai/videos. –> Oxen is an excellent programmer
Reference
Hepta. “How to Judge RWKV (arXiv 2305.13048)?,” September 15, 2023. https://www.zhihu.com/question/602564718/answer/3211669817.
[Efficiently Modeling Long Sequences with Structured State Spaces - Albert Gu | Stanford MLSys #46 (youtube.com)](https://www.youtube.com/watch?v=EvQ3ncuriCM) |
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained) (youtube.com)
https://www.youtube.com/watch?v=8Q_tqwpTpVU&ab_channel=UmarJamil
https://www.youtube.com/watch?v=iskuX3Ak9Uk&ab_channel=TrelisResearch. Good!