Open source repo

airllm

AirLLM 70B inference with single 4GB GPU

Visit resource

Why it is on 0CAP

AirLLM 70B inference with single 4GB GPU