Kernel library for high-speed LLM serving on PaddlePaddle.

Library

GitHub Repository

infrastructureefficiency

Notes

Date approximate.