DeepSeek
@deepseek-aiOrganizationOn the leaderboard
| Rank | Repository | Stars |
|---|---|---|
| 94 | deepseek-ai/DeepSeek-V3 | 102,477 |
| 119 | deepseek-ai/DeepSeek-R1 | 91,959 |
| 717 | deepseek-ai/awesome-deepseek-integration | 36,153 |
Top repositories by stars
- deepseek-ai/DeepSeek-V3(on leaderboard)Python101,608
- deepseek-ai/DeepSeek-R1(on leaderboard)91,844
- deepseek-ai/awesome-deepseek-integration(on leaderboard)
Integrate the DeepSeek API into popular softwares
35,436 - deepseek-ai/DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Python22,794 - deepseek-ai/DeepSeek-OCR
Contexts Optical Compression
Python22,479 - deepseek-ai/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Python17,707 - deepseek-ai/FlashMLA
FlashMLA: Efficient Multi-head Latent Attention Kernels
C++12,492 - deepseek-ai/3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++9,709 - deepseek-ai/DeepEP
DeepEP: an efficient expert-parallel communication library
Cuda8,991 - deepseek-ai/open-infra-index
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
7,967 - deepseek-ai/DeepSeek-LLM
DeepSeek LLM: Let there be answers
Makefile6,732 - deepseek-ai/DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
6,457 - deepseek-ai/DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda6,183 - deepseek-ai/DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Python5,228 - deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
4,994 - deepseek-ai/smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Python4,922 - deepseek-ai/DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Python4,071 - deepseek-ai/Engram
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Python3,701 - deepseek-ai/DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Python3,162 - deepseek-ai/DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Python3,004 - deepseek-ai/DualPipe
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
Python2,919 - deepseek-ai/DeepSeek-OCR-2
Visual Causal Flow
Python2,289 - deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Python1,894 - Python1,546
- Python1,480
- deepseek-ai/EPLB
Expert Parallelism Load Balancer
Python1,346 - deepseek-ai/profile-data
Analyze computation-communication overlap in V3/R1.
1,144 - deepseek-ai/awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
768 - deepseek-ai/ESFT
Expert Specialized Fine-Tuning
Python729 - Python552
- deepseek-ai/LPLB
An early research stage expert-parallel load balancer for MoE models based on linear programming.
Python497