
MarkushGrapher: Multi-modal Markush Structure Recognition
Multi-modal transformer combining vision, text, and layout encoding to extract complex Markush structures from patent …

Multi-modal transformer combining vision, text, and layout encoding to extract complex Markush structures from patent …

A deep learning model for Optical Chemical Structure Recognition (OCSR) using SwinV2 and GPT-2 to convert molecular …

A graph-based deep learning approach for optical chemical structure recognition that outperforms image captioning …

A vision-based deep learning framework that unifies molecule detection, reaction parsing, and OCSR for page-level …

OCSU task for translating molecular images into multi-level descriptions. Introduces Vis-CheBI20 dataset and …

Novel Ring-Free Language representation and Molecular Skeleton Decoder architecture for improved optical chemical …

Deep learning OCSR model using keypoint estimation to detect atom and bond centers for graph-based molecular structure …

Deep learning framework using CNN-LSTM image captioning to convert hand-drawn hydrocarbon structures into SMILES strings …

Transformer-based approach for Optical Chemical Structure Recognition converting chemical images to SELFIES strings with …

Vision Transformer encoder with Transformer decoder for molecular image-to-InChI translation, achieving state-of-the-art …

An end-to-end framework (RCGD) and unambiguous markup language (SSML) for recognizing complex handwritten chemical …

A Transformer-based model (ICMDT) for converting chemical structure images into InChI text strings using a novel Deep …