1 repo
Access points for pre-trained model parameters.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Model Weights. Refine with filters or upvote what's useful.
DeepSeek-V3 is a large language model that provides comprehensive resources for model utilization, including technical specifications, pre-trained weights, and evaluation benchmarks. The project details the core transformer architecture, including parameter counts and multi-token prediction modules, while supporting na