Replit's code model: the docs are the product
A 3B-parameter coding LLM whose main artifact is a very thorough README on how to train it elsewhere.

What it does
ReplitLM is the companion repo for Replit’s replit-code-v1-3b, a 3-billion-parameter language model trained for code completion. The repository doesn’t contain the model weights—those live on Hugging Face—but instead provides configuration snippets, tokenizer handling notes, and step-by-step guides for inference, fine-tuning, and instruction tuning. Think of it as the operator’s manual.
The interesting bit
The README is refreshingly honest about the friction involved. It documents specific workarounds: you must patch LLM Foundry’s data-prep script to pass trust_remote_code=True, and you must manually null out model.tokenizer before checkpointing because Composer chokes on tokenizers that include *.py files. Most projects hide this kind of duct tape; Replit puts it in bold with warning emoji.
Key highlights
- Model weights, vocab, and code are under three different licenses (CC BY-SA 4.0, CC BY-SA 4.0, and Apache 2.0 respectively)
- Hosted GPU demo available via Hugging Face Spaces
- Recommended training stack is MosaicML’s LLM Foundry + Composer, not a custom Replit pipeline
- Includes Alpaca-style instruction tuning guide using a community fork (
stanford_alpaca-replit) by contributor Teknium replit-code-v1_5-3bis listed as “Coming Soon” with no date
Caveats
- The README is the entire product; this repo is essentially documentation and configuration glue
- Training requires editing upstream dependencies (LLM Foundry, Composer) to work around known incompatibilities
- No benchmarks, evaluation results, or comparison tables are provided in the sources
Verdict Grab this if you’re already committed to the MosaicML stack and want a head start on fine-tuning a mid-size code model. Skip it if you’re looking for drop-in weights, training code you can run untouched, or any performance claims to justify the hardware spend.