
PubMed-OCR: PMC Open Access OCR Annotations
A large-scale dataset of 209K+ articles with OCR and layout bounding boxes, enabling layout-aware modeling and document …

A large-scale dataset of 209K+ articles with OCR and layout bounding boxes, enabling layout-aware modeling and document …

A 14B-parameter chemical reasoning LLM enhanced with atomized functional group knowledge and mix-sourced distillation …

A systematic evaluation of RoBERTa transformers pretrained on 77M PubChem SMILES for molecular property prediction …

Vision-language pipeline extracting chemical reaction data from PDF figures and tables into structured knowledge graphs …

Weakly supervised OCSR framework combining object detection and graph construction to recognize chemical structures from …

A deep learning method using EfficientNet and Transformer to convert hand-drawn chemical structures into SMILES codes, …

Open-source OCSR platform combining Mask R-CNN segmentation and Transformer recognition, trained on 450M+ synthetic …

A graph-based deep learning approach for optical chemical structure recognition that outperforms image captioning …

OCSU task for translating molecular images into multi-level descriptions. Introduces Vis-CheBI20 dataset and …

An end-to-end framework (RCGD) and unambiguous markup language (SSML) for recognizing complex handwritten chemical …
Deep learning OCSR tool using YOLOv5 and MobileNetV2 to extract machine-readable molecular structures from scientific …
Deep learning model using Swin Transformer and Focal Loss for OCSR, achieving 98.58% accuracy on synthetic benchmarks.