Hand-drawn chemical structure recognition addresses the challenge of interpreting molecules sketched by hand, whether photographed from paper, scanned from notebooks, or captured from whiteboard images. These inputs differ substantially from the clean, typeset diagrams that most OCSR systems target: strokes vary in thickness and regularity, characters are ambiguous, and bond angles are imprecise. Early work like Ramel (1999) and Ouyang & Davis (2007) applied structural analysis and domain-constrained sketch recognition. More recent approaches use deep learning, including DECIMER’s hand-drawn variant, ChemReco’s EfficientNet-Transformer pipeline, and data augmentation strategies like OCSAug’s diffusion-based pipeline to bridge the domain gap between synthetic training data and real hand-drawn inputs.

YearPaperKey Idea
1999Structural Analysis of Handwritten Chemical FormulasStructural graph representation with recursive specialists
2007Hand-Drawn Chemical Diagram Recognition (AAAI 2007)Sketch recognition using chemical valence to correct errors
2008Handwritten Chemical Ring Recognition with Neural NetworksTwo-phase neural network pipeline for 23 types of heterocyclic rings
2021ChemPix: Hand-Drawn Hydrocarbon Structure RecognitionCNN-LSTM image captioning for hand-drawn hydrocarbons to SMILES
2023Handwritten Chemical Structure Recognition with RCGDEnd-to-end framework with guided graph traversal and SSML markup
2024AtomLenz: Atom-Level OCSR with Limited SupervisionWeakly supervised object detection and graph construction for OCSR
2024ChemReco: Hand-Drawn Chemical Structure RecognitionEfficientNet and Transformer achieving 96.9% accuracy
2024Enhanced DECIMER for Hand-Drawn Structure RecognitionEfficientNetV2 + Transformer for hand-drawn structures to SMILES
2025OCSAug: Diffusion-Based Augmentation for Hand-Drawn OCSRDDPM and RePaint augmentation to bridge synthetic-to-real domain gap