Lemonade
High-performance local LLM server with GPU and NPU acceleration support, featuring multiple inference engines, OpenAI-compatible API, and cross-platform model deployment for AMD Ryzen AI processors.
⭐ 1,845lemonade-sdk
amdllamallmllm-inference