Deepseek V3 Python Tutorial

llm-paper-tutorials /papers /A1-deepseek-v3

We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, ...

NextBigFuture

DeepSeek Local Model Testing and R1 Tutorial

The Opensource DeepSeek R1 model and the distilled local versions are shaking up the AI community. The Deepseek models are the best performing open source models and are highly useful as agents and ...

Mashable

DeepSeek v3.2: What's new and how does it compare to ChatGPT?

Remember DeepSeek, the large language model (LLM) out of China that was released for free earlier this year and upended the AI industry? Without the funding and infrastructure of leaders in the space ...

TWCN Tech News

How to use DeepSeek V3 Coder in Windows 11?

If you want to learn how to use DeepSeek V3 Coder in Windows 11, this post will guide you. DeepSeek-V3 Coder is a specialized version of the DeepSeek-V3 model. It leverages natural language processing ...

scmp.com

China’s DeepSeek challenges Google DeepMind and OpenAI with new AI model

Artificial intelligence start-up DeepSeek has unveiled its most powerful model variant, DeepSeek-V3.2-Speciale, which is said to match Google DeepMind’s new Gemini 3 Pro model in certain tasks, ...

Gizmochina

DeepSeek Releases V3.1 Model: What’s New?

Chinese AI company DeepSeek has released version 3.1 of its flagship large language model, expanding the context window to 128,000 tokens and increasing the parameter count to 685 billion. The update ...

VentureBeat

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

DeepSeek continues to push the frontier of generative AI...in this case, in terms of affordability. The company has unveiled its latest experimental large language model (LLM), DeepSeek-V3.2-Exp, that ...

GitHub

llm_text_detection_deepseek_api_tutorial

一个面向初学者的 DeepSeek API 文本分类教学项目。项目围绕 Kaggle 数据集 prajwaldongre/llm-detect-ai-generated-vs-student-generated-text ...

InfoQ

DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks

DeepSeek released DeepSeek-V3.2, a family of open-source reasoning and agentic AI models. The high compute version, DeepSeek-V3.2-Speciale, performs better than GPT-5 and comparably to Gemini-3.0-Pro ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results