
LLMs for Page Stream Segmentation
Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …

Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …

An open-source framework integrating DeepChem and Ray for training and benchmarking chemical foundation models like …

A 14B-parameter chemical reasoning LLM enhanced with atomized functional group knowledge and mix-sourced distillation …

Optimizing transformer pretraining for molecules using MLM vs MTR objectives, scaling to 77M compounds from PubChem for …

A systematic evaluation of RoBERTa transformers pretrained on 77M PubChem SMILES for molecular property prediction …

Vision-language pipeline extracting chemical reaction data from PDF figures and tables into structured knowledge graphs …
A deep learning method using EfficientNet and Transformer to convert hand-drawn chemical structures into SMILES codes, …
Benchmark of 8 open-access OCSR methods on 2702 manually curated patent images, with ChemIC classifier for hybrid …
Open-source OCSR platform combining Mask R-CNN segmentation and Transformer recognition, trained on 450M+ synthetic …
A Transformer-based OCSR model introducing dual-path modules (CGFE and SDGLA) to improve global context awareness and …
An improved encoder-decoder model (EfficientNetV2 + Transformer) for converting hand-drawn chemical structures into …
A deep learning model for Optical Chemical Structure Recognition (OCSR) using SwinV2 and GPT-2 to convert molecular …