Is ReplitLM open source?

Yes — replit/ReplitLM is open source, released under the Apache-2.0 license.

What language is ReplitLM written in?

replit/ReplitLM is primarily written in Python.

How popular is ReplitLM?

replit/ReplitLM has 1.1k stars on GitHub.

Where can I find ReplitLM?

replit/ReplitLM is on GitHub at https://github.com/replit/ReplitLM.

← all repositories

replit/ReplitLM

Replit's code model: the docs are the product

A 3B-parameter coding LLM whose main artifact is a very thorough README on how to train it elsewhere.

★1.1k stars Python Language Models Coding Assistants

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does ReplitLM is the companion repo for Replit’s replit-code-v1-3b, a 3-billion-parameter language model trained for code completion. The repository doesn’t contain the model weights—those live on Hugging Face—but instead provides configuration snippets, tokenizer handling notes, and step-by-step guides for inference, fine-tuning, and instruction tuning. Think of it as the operator’s manual.

The interesting bit The README is refreshingly honest about the friction involved. It documents specific workarounds: you must patch LLM Foundry’s data-prep script to pass trust_remote_code=True, and you must manually null out model.tokenizer before checkpointing because Composer chokes on tokenizers that include *.py files. Most projects hide this kind of duct tape; Replit puts it in bold with warning emoji.

Key highlights

Model weights, vocab, and code are under three different licenses (CC BY-SA 4.0, CC BY-SA 4.0, and Apache 2.0 respectively)
Hosted GPU demo available via Hugging Face Spaces
Recommended training stack is MosaicML’s LLM Foundry + Composer, not a custom Replit pipeline
Includes Alpaca-style instruction tuning guide using a community fork (stanford_alpaca-replit) by contributor Teknium
replit-code-v1_5-3b is listed as “Coming Soon” with no date

Caveats

The README is the entire product; this repo is essentially documentation and configuration glue
Training requires editing upstream dependencies (LLM Foundry, Composer) to work around known incompatibilities
No benchmarks, evaluation results, or comparison tables are provided in the sources

Verdict Grab this if you’re already committed to the MosaicML stack and want a head start on fine-tuning a mid-size code model. Skip it if you’re looking for drop-in weights, training code you can run untouched, or any performance claims to justify the hardware spend.

Frequently asked

What is replit/ReplitLM?: A 3B-parameter coding LLM whose main artifact is a very thorough README on how to train it elsewhere.
Is ReplitLM open source?: Yes — replit/ReplitLM is open source, released under the Apache-2.0 license.
What language is ReplitLM written in?: replit/ReplitLM is primarily written in Python.
How popular is ReplitLM?: replit/ReplitLM has 1.1k stars on GitHub.
Where can I find ReplitLM?: replit/ReplitLM is on GitHub at https://github.com/replit/ReplitLM.