Skip to main content
  • NPU acceleration is Windows-only. GPU/CPU work cross-platform.
  • State is in-memory (Python) — resets on server restart.
  • First model download is 2–4 GB.
  • Single-user only — no multi-user support.