Is Qwen3 open source?

Yes — QwenLM/Qwen3 is an open-source project tracked on heatdrop.

What language is Qwen3 written in?

QwenLM/Qwen3 is primarily written in Python.

How popular is Qwen3?

QwenLM/Qwen3 has 27.4k stars on GitHub.

Where can I find Qwen3?

QwenLM/Qwen3 is on GitHub at https://github.com/QwenLM/Qwen3.

← all repositories

QwenLM/Qwen3

Alibaba's Qwen3 splits the difference: think hard, or don't

A family of open-weight LLMs that ships separate 'thinking' and 'non-thinking' variants, plus a 1-million-token context window for the patient.

★27.4k stars Python Language Models Inference · Serving LLMOps · Eval

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does Qwen3 is Alibaba Cloud’s flagship open-weight LLM series, ranging from a 0.6B-parameter edge model up to a 235B-A22B MoE beast. The latest Qwen3-2507 release (July 2025) splits each size into two distinct variants: an Instruct model for fast, general-purpose chat and a Thinking model that reasons step-by-step before answering. Both handle 256K tokens out of the box, with a 1M-token mode available for the largest checkpoints.

The interesting bit Rather than a single model with a toggle, Qwen3-2507 commits to architecture: Instruct models literally cannot generate thinking blocks, and Thinking models always do. The previous release let you flip enable_thinking on the same weights; the new approach treats reasoning depth as a product decision, not a runtime flag. It’s a bet that users know which mode they need before they load the GPU.

Key highlights

Dense and MoE architectures from 0.6B to 235B-A22B parameters
Separate Instruct and Thinking checkpoints; no mode-switching within a model
256K context standard, 1M tokens supported on 235B-A22B and 30B-A3B variants
100+ languages and dialects with multilingual instruction following
Integrates with the usual stack: Transformers, vLLM, SGLang, llama.cpp, Ollama, LM Studio, plus quantization tools (GPTQ, AWQ, GGUF)
Training recipes for SFT and RLHF (marked TODO) via Axolotl and LLaMA-Factory

Caveats

The Qwen3-2507 evaluation blog is listed as “coming soon”; benchmark claims are currently unverified in the README
Thinking models consume significantly more output tokens—32K generation limits are recommended, not 16K
RLHF training guidance remains incomplete

Verdict Worth a look if you want open weights with a clear reasoning/non-reasoning split and serious long-context ambition. Skip if you need a single model that adapts its depth on the fly—that was the old Qwen3-2504, not this one.

Frequently asked

What is QwenLM/Qwen3?: A family of open-weight LLMs that ships separate 'thinking' and 'non-thinking' variants, plus a 1-million-token context window for the patient.
Is Qwen3 open source?: Yes — QwenLM/Qwen3 is an open-source project tracked on heatdrop.
What language is Qwen3 written in?: QwenLM/Qwen3 is primarily written in Python.
How popular is Qwen3?: QwenLM/Qwen3 has 27.4k stars on GitHub.
Where can I find Qwen3?: QwenLM/Qwen3 is on GitHub at https://github.com/QwenLM/Qwen3.