Stars
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models