← all repositories
meta-llama/llama3

Meta's Llama 3 repo is now a redirect signpost

The official Llama 3 repository has been deprecated and split into five successor projects as part of the Llama 3.1 "Llama Stack" expansion.

29.3k stars Python Language Models
llama3
Velocity · 7d
+36
★ / day
Trend
steady
star history

What it does

This repository once housed the official reference implementation for loading and running Meta’s Llama 3 models (8B and 70B parameters). It provided minimal inference scripts, a tokenizer with a specific chat format, and a download.sh script for fetching weights via signed URLs. Now it functions primarily as a deprecation notice pointing users to newer repositories.

The interesting bit

Meta’s model release strategy has shifted from monolithic repos to a distributed “Llama Stack” — models, safety tools, toolchain interfaces, agentic systems, and community recipes now live in separate repositories. The README itself is now mostly a map to these five successor projects.

Key highlights

  • Supports 8B (single GPU) and 70B (8-GPU model-parallel) parameter variants
  • Maximum sequence length of 8192 tokens, with cache pre-allocation tuned via max_seq_len and max_batch_size
  • Instruction-tuned models require strict ChatFormat token formatting (<|begin_of_text|>, role tags, <|eot_id|>)
  • Weights accessible via Meta’s approval-gated download URLs (24-hour expiry) or Hugging Face with license acceptance
  • Includes safety boilerplate: links to PurpleLlama, Responsible Use Guide, and output feedback mechanisms

Caveats

  • Repository is explicitly deprecated; active development moved to llama-models, llama-toolchain, and others
  • Download links expire after 24 hours and have download count limits
  • README warns that “testing conducted to date has not — and could not — cover all scenarios”

Verdict

Worth a quick visit if you’re tracing Meta’s repository evolution or need the exact original Llama 3 reference code. Everyone else should head directly to the successor repos, particularly llama-models for weights and llama-cookbook for practical recipes.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.