DeepSeek
https://deepseek.comChinese AI research lab known for highly efficient open-source models. Their DeepSeek-V3 model achieves top performance at fraction of training cost.
Models by DeepSeek (2)
DeepSeek-V3
128K ctxHighly efficient 671B MoE model trained on 14.8T tokens. Achieves top benchmark scores at fraction of typical training cost.
ResearchCodeComplex
DeepSeek-R1
64K ctxReasoning-focused model achieving strong performance on math and coding benchmarks through reinforcement learning.
CodeComplex