←Backtogethercomputer/RedPajama-Data0Copy as MarkdownView on GitHub↗4,947 stars·371 forks·Python·Apache-2.0·0 viewsRedPajama DataFeaturesData Resources - Large-scale dataset for pretraining language models.