Question 1

What is microsoft/MInference?

Accepted Answer

MInference is a sparse attention kernel that accelerates long-context LLM inference by up to 10x on A100 GPUs.

Question 2

Is MInference open source?

Accepted Answer

Yes — microsoft/MInference is open source, released under the MIT license.

Question 3

What language is MInference written in?

Accepted Answer

microsoft/MInference is primarily written in Python.

Question 4

How popular is MInference?

Accepted Answer

microsoft/MInference has 1.2k stars on GitHub.

Question 5

Where can I find MInference?

Accepted Answer

microsoft/MInference is on GitHub at https://github.com/microsoft/MInference.

Frequently asked