
Image-to-Sequence OCSR: A Comparative Analysis
Comparative analysis of image-to-sequence OCSR methods across architecture, output format, training data, and compute …

Comparative analysis of image-to-sequence OCSR methods across architecture, output format, training data, and compute …

A multimodal search engine that integrates text passages, molecular diagrams, and reaction data to enable passage-level …

A diffusion-based data augmentation pipeline (OCSAug) using DDPM and RePaint to improve optical chemical structure …

Weakly supervised OCSR framework combining object detection and graph construction to recognize chemical structures from …

A deep learning method using EfficientNet and Transformer to convert hand-drawn chemical structures into SMILES codes, …

A 26B parameter multimodal LLM for chemistry, combining InternViT-6B and ChemLLM-20B for molecular structure …

Benchmark of 8 open-access OCSR methods on 2702 manually curated patent images, with ChemIC classifier for hybrid …

Open-source OCSR platform combining Mask R-CNN segmentation and Transformer recognition, trained on 450M+ synthetic …

A Transformer-based OCSR model introducing dual-path modules (CGFE and SDGLA) to improve global context awareness and …

An improved encoder-decoder model (EfficientNetV2 + Transformer) for converting hand-drawn chemical structures into …

Deep learning model using improved SwinTransformer encoder and attention-based feature fusion to convert molecular …

Multi-modal transformer combining vision, text, and layout encoding to extract complex Markush structures from patent …