Open source repo

olmocr

Toolkit for linearizing PDFs for LLM datasets/training

Visit resource

Why it is on 0CAP

Toolkit for linearizing PDFs for LLM datasets/training