Computational Chemistry
DECIMER 1.0: Transformers for Chemical Image Recognition

DECIMER 1.0: Transformers for Chemical Image Recognition

Transformer-based approach for Optical Chemical Structure Recognition converting chemical images to SELFIES strings with …

Computational Chemistry
End-to-End Transformer for Molecular Image Captioning

End-to-End Transformer for Molecular Image Captioning

Vision Transformer encoder with Transformer decoder for molecular image-to-InChI translation, achieving state-of-the-art …

Computational Chemistry
ICMDT: Automated Chemical Image Recognition with Deep TNT

ICMDT: Automated Chemical Image Recognition

A Transformer-based model (ICMDT) for converting chemical structure images into InChI text strings using a novel Deep …

Computational Chemistry

Image2SMILES: Transformer OCSR with Synthetic Data Pipeline

Transformer-based OCSR using a novel synthetic data generation pipeline for robust molecular image interpretation across …

Computational Chemistry

String Representations for Chemical Image Recognition

Ablation study comparing SMILES, DeepSMILES, SELFIES, and InChI for OCSR. SMILES achieves highest accuracy; SELFIES …

Computational Chemistry

SwinOCSR: Vision Transformers for Chemical OCR

Deep learning model using Swin Transformer and Focal Loss for OCSR, achieving 98.58% accuracy on synthetic benchmarks.

Computational Chemistry
Optical chemical structure recognition example

IMG2SMI: Translating Molecular Structure Images to SMILES

Campos & Ji's method for converting 2D molecular images to SMILES strings using Transformers and SELFIES representation.

Computational Chemistry
GTR-CoT: Graph Traversal Chain-of-Thought for Molecules

GTR-CoT: Graph Traversal Chain-of-Thought for Molecules

GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition accuracy.

Computational Chemistry
αExtractor extracts structured chemical information from biomedical literature

αExtractor: Chemical Info from Biomedical Literature

αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …

Computational Chemistry
A colored molecule with annotations, representing the diverse drawing styles found in scientific papers that OCSR models must handle.

MolParser-7M & WildMol: Large-Scale OCSR Datasets

MolParser-7M is the largest OCSR dataset with 7.7M image-text pairs of molecules and E-SMILES, including 400k real-world …

Computational Chemistry
Optical chemical structure recognition example

MolParser: End-to-End Molecular Structure Recognition

MolParser converts molecular images from scientific documents to machine-readable formats using end-to-end learning with …

Computational Chemistry
Adaptive grid merging visualization for benzene molecule showing multi-resolution spatial discretization

Beyond Atoms: 3D Space Modeling for Molecular Pretraining

Lu et al. introduce SpaceFormer, a Transformer that models entire 3D molecular space (not just atoms) for superior …