Document Processing
Diagram showing page stream segmentation workflow: an input stream of pages is processed through binary classification of page pairs to predict document breaks, producing segmented output documents

LLMs for Page Stream Segmentation

Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …...

Computational Chemistry

ChemInfty: Robust Segmentation and Recognition of Chemical Structures in Low-Quality Patent Images

Fujiyoshi et al.'s segment-based approach for recognizing chemical structures in challenging Japanese patent images with …...