mistral

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Chinese

Mistral AI focuses on efficiency and coding capabilities. Their "Mixture of Experts" (MoE) architecture (Mixtral) changed the game.

Mistral AI 专注于效率与编码能力。其“混合专家”（MoE）架构（Mixtral）颠覆了行业格局。

Coding: Mistral Large 2 (Codestral) is specifically optimized for code generation.
Efficiency: Mixtral 8x7B offers GPT-3.5+ performance at a fraction of the inference cost.
Open Weights: Apache 2.0 licenses (for smaller models).

Only a subset of parameters (experts) are active per token. High quality, low compute.

每个token仅激活部分参数（专家模块），兼顾高质量与低算力消耗。

A model trained specifically on 80+ programming languages.

一款针对80余种编程语言训练的模型。

Mistral's chat interface (

chat.mistral.ai

Mistral的聊天界面（

chat.mistral.ai

）。

Do:

Use
codestral-mamba
: For infinite context window coding tasks (linear time complexity).
Deploy via vLLM: Mistral models run exceptionally well on vLLM.

Don't:

Don't ignore small models: Mistral NeMo (12B) is surprisingly capable for RAG.

推荐做法：

不推荐做法：