#redpajama
Read more stories on Hashnode
Articles with this tag
Key Highlights Red Pajama 2 is open-source language model pre-training dataset containing a massive 30 trillion tokens, it the largest public dataset...