DeepSeek Coder V2 开源发布

7bzre4bg · 2024 年6 月 17 日 12:31

今天，全球首个在代码、数学能力上与GPT-4-Turbo争锋的模型，DeepSeek-Coder-V2，正式上线和开源。

lueluelue · 2024 年6 月 17 日 12:37

Silicon上能用吗

wo_zu_long · 2024 年6 月 17 日 12:38

挺强的，略微体验了一下确实逻辑能力不错

handsome · 2024 年6 月 17 日 12:42

多少钱啊

YanTeng_Duan · 2024 年6 月 17 日 12:44

这么牛

wo_zu_long · 2024 年6 月 17 日 12:44

一样的白菜，看测试集的分数，应该是可以当作4o api的替代在override上使用了

7bzre4bg · 2024 年6 月 17 日 12:46

Ollama估计过几天就上线了，现在还没有coder版本

7bzre4bg · 2024 年6 月 17 日 12:49

之前没出二代时候，始皇就推荐用deepseek-coder做代码补全

handsome · 2024 年6 月 17 日 12:50

卧槽，太香了

wo_zu_long · 2024 年6 月 17 日 12:50

这次是甚至可以替代chat了，而非只能补全

lueluelue · 2024 年6 月 17 日 12:52

这个coder也是236b啊

wo_zu_long · 2024 年6 月 17 日 12:54

尝试在三级区白嫖的英特尔试试？不过不支持cuda，不知道能部署不，还不如直接买官方的

zhong_little · 2024 年6 月 17 日 12:57

api 版本上下文还是 32k，期待上新 128k 版本

wo_zu_long · 2024 年6 月 17 日 12:58

This performance is notable as it breaks the dominance typically seen from closed-source models, standing out as a leading open-source contender. It is surpassed only by GPT-4o, which leads with an average score of 76.4%. DeepSeek-Coder-V2-Instruct shows top-tier results across a variety of languages, including the highest scores in Java and PHP, and strong performances in Python, C++, C#, TypeScript, and JavaScript, underscoring its robustness and versatility in handling diverse coding challenges.

lueluelue · 2024 年6 月 17 日 12:59

期待deepseek-math也更新！

wo_zu_long · 2024 年6 月 17 日 13:01

这次的coder-v2使用的训练方法与deepseek-math是相同的，可以说是math的整合版

To collect code-related and math-related web texts from Common Crawl, we follow the same pipeline as DeepSeekMath(Shao et al., 2024).

lueluelue · 2024 年6 月 17 日 13:04

谢谢！！！

wo_zu_long · 2024 年6 月 17 日 13:07

数学能力是相当不错的，虽然只有10%的数学语料

The pre-training data for DeepSeek-Coder-V2 primarily consists of 60% source code, 10% math corpus, and 30% natural language corpus.

The results, presented in Table 9, were obtained using greedy decoding without the aid of tools or voting techniques, unless otherwise specified. DeepSeek-Coder-V2 achieved an accuracy of 75.7% on the MATH benchmark and 53.7% on Math Odyssey, comparable to the state-of-the-art GPT-4o. Additionally, DeepSeek-Coder-V2 solves more problems from AIME2024 than the other models, demonstrating its strong mathematical reasoning capabilities.

可以看看deepseek会不会用v2架构更新math模型

lueluelue · 2024 年6 月 17 日 13:21

太好了，高考题启动！

~~可能拿高考题训练了hhhh~~

bbb · 2024 年6 月 17 日 13:23

支持开源！

话题		回复	浏览量
241121 三花AI日报：OpenAI 的 GPT-4o 模型重夺竞技场榜首；DeepSeek 推出全新推理模型 R1-Lite 预览版；谷歌推出专为教育研究微调的 AI 模型前沿快讯人工智能	9	792	2024 年11 月 21 日
250326 三花 AI 日报：OpenAI 推出了 GPT-4o 的图像生成功能；谷歌 Gemini 2.5 Pro 实验版发布；DeepSeek-V3 非推理模型首次登顶排行榜前沿快讯人工智能	30	1184	2025 年3 月 26 日
DeepSeek 放出超重磅全新模型！ - 新模型直接对标 OpenAI 搞七捻三人工智能	41	1040	2024 年11 月 21 日
新版 V3 模型的百科知识（MMLU-Pro, GPQA）、数学（MATH-500, AIME 2024）和代码任务（LiveCodeBench）表现均有提升前沿快讯	3	173	2025 年3 月 27 日
Deepseek V3 0324官方readme来了开发调优人工智能	7	690	2025 年3 月 25 日

DeepSeek Coder V2 开源发布

相关话题