← all repositories

microsoft/Llama-2-Onnx

Microsoft provides optimized ONNX-format versions of Meta's Llama 2 language models for efficient cross-platform inference.

Llama-2-Onnx
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

This repository hosts ONNX (Open Neural Network Exchange) optimized versions of Meta’s Llama 2 language models. It provides multiple variants across different model sizes (7B, 13B) and precision formats (float16, float32, fine-tuned). The ONNX format enables efficient cross-platform inference serving with hardware acceleration.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.