Open source repo
omlx
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Visit resourceWhy it is on 0CAP
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar