Showing projects tagged with "qwen3".
Unified AI inference SDK with custom NexaML engine providing Day-0 model support across NPU, GPU, and CPU with GGUF, MLX, and .nexa format compatibility.
Native macOS (Swift/SwiftUI) local LLM chat interface with RAG, function calling, deep research agents, and privacy-first offline processing.