Question 1

What is ztxz16/fastllm?

Accepted Answer

A backend-independent, C++-based high-performance large language model inference library supporting dense and MoE architectures with tensor parallelism and FP8/INT4 quantization.

Question 2

Is fastllm open source?

Accepted Answer

Yes — ztxz16/fastllm is open source, released under the Apache-2.0 license.

Question 3

What language is fastllm written in?

Accepted Answer

ztxz16/fastllm is primarily written in C++.

Question 4

How popular is fastllm?

Accepted Answer

ztxz16/fastllm has 4.9k stars on GitHub.

Question 5

Where can I find fastllm?

Accepted Answer

ztxz16/fastllm is on GitHub at https://github.com/ztxz16/fastllm.

Frequently asked