Computational Chemistry
Chemical diagram showing a generalized Grignard reaction

RInChI: Reaction International Chemical Identifier

RInChI extends InChI to create unique, machine-readable identifiers for chemical reactions and database searching.

Computational Chemistry
SELFIES molecular representation overview

SELFIES: The Original Paper (Krenn et al. 2020)

The 2020 paper introducing SELFIES, the 100% robust molecular representation that solves SMILES validity problems in ML …

Computational Chemistry
Benzene molecular structure diagram

SMILES: The Original Paper (Weininger 1988)

Weininger's 1988 paper introducing SMILES notation, the string-based molecular representation that revolutionized …

Computational Chemistry
GTR-CoT: Graph Traversal Chain-of-Thought for Molecules

GTR-CoT: Graph Traversal Chain-of-Thought for Molecules

GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition accuracy.

Computational Chemistry
Optical chemical structure recognition example

MolRec: Chemical Structure Recognition at CLEF 2012

MolRec achieves 95%+ accuracy on simple structures but struggles with complex diagrams, revealing rule-based OCSR limits …

Computational Chemistry
Optical chemical structure recognition example

MolRec: Rule-Based OCSR System

Rule-based system for optical chemical structure recognition using vectorization and geometric analysis, achieving 95% …

Computational Chemistry
Markush structure diagram

SubGrapher: Visual Fingerprinting of Chemical Structures

Novel OCSR method creating molecular fingerprints from images through functional group segmentation for database …

Computational Chemistry
The transformation from a 2D chemical structure image to a SMILES representation

What is Optical Chemical Structure Recognition (OCSR)?

A micro-review of Optical Chemical Structure Recognition (OCSR), covering rule-based systems to modern deep learning …

Computational Chemistry
αExtractor extracts structured chemical information from biomedical literature

αExtractor: Chemical Info from Biomedical Literature

αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …

Computational Chemistry
ChemInfty: Chemical Structure Recognition in Patent Images

ChemInfty: Chemical Structure Recognition in Patent Images

Fujiyoshi et al.'s segment-based approach for recognizing chemical structures in challenging Japanese patent images with …

Computational Chemistry

MolNexTR: Dual-Stream Molecular Image Recognition

Dual-stream encoder combining ConvNext and ViT for robust optical chemical structure recognition across diverse …

Computational Chemistry
A colored molecule with annotations, representing the diverse drawing styles found in scientific papers that OCSR models must handle.

MolParser-7M & WildMol: Large-Scale OCSR Datasets

MolParser-7M is the largest OCSR dataset with 7.7M image-text pairs of molecules and E-SMILES, including 400k real-world …