C++ implementation for efficient local inference of Qwen models.

Library

efficiency

Notes

Repository archived on December 6, 2024.