Document Processing

LLMs for Page Stream Segmentation

We enhance the TABME benchmark for page stream segmentation, creating TABME++, and show that fine-tuned, decoder-based …...