Open source repo

omlx

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Why it is on 0CAP

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar