High-performance toolkit for compressing, deploying, and serving LLMs at scale.

Library

efficiencyframework

Notes

Date approximate.