C++ implementation for efficient local inference of Qwen models.

Library

GitHub Repository

efficiency

Notes

Repository archived on December 6, 2024.