←Backesbatmop/MNBVC0Copy as MarkdownView on GitHub↗4,123 stars·287 forks·mit·0 viewsMNBVCFeaturesDatasets and Corpora - Massive, continuously updated Chinese language corpus.Pre-training Datasets - Massive, diverse Chinese text corpus from internet sources.