salesforce/xgen
A 7B parameter open-source large language model family supporting up to 8K input sequence length, released by Salesforce AI Research.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
XGen provides three model variants: XGen-7B-4K-Base, XGen-7B-8K-Base, and XGen-7B-8K-Inst (instruction-finetuned). The models are trained to handle long sequences up to 8K tokens and can be loaded via the HuggingFace transformers library for auto-regressive text generation. They use the OpenAI Tiktoken tokenizer for subword tokenization.